SUBSTITUTE SEQUENCE LISTING 

<110> Bihain, Bernard 

Bougueleret , Lydie 
Yen-Pot in, Frances 



<120> LSR Receptor, Its Activity, Its Cloning and Its Application To 
The Diagnosis Prevention And/or Treatment of Obesity and 
Related Risks or Complications 

<130> GENSET . 035APC 

<150> FR 97/10088 
<151> 1997-08-06 

<150> FR 98/05032 
<151> 1998-04-22 

<160> 41 

<170> Patent. pm 

<210> 1 
<211> 2097 
<212> DNA 

<213> Rattus norvegicus 
<400> 1 

accgctcacc aggtcagttg tccccggaaa gccgaaggca tgagcttcgc ccaagttctt 60 

tttatgggtt agaactcctc cagagcgggg gaaaaaggac ttggaatagg ggcgggacgg 120 

agcacgcacc cttctccgcc ttggttctcg ccgcgccccc tactctcggg atacttggga 180 

qgggacgcgc gggcaccgtc gctgctagac ggccgcg atg gcg ccg gcg gcc ggc 235 
^ Met Ala Pro Ala Ala Gly 

1 5 



gcg tgt get ggg gcg cct gac tec cac cca get acc gtg gtc ttc gtg 
Ala Cys Ala Gly Ala Pro Asp Ser His Pro Ala Thr Val Val Phe Val 

10 15 20 



gtg act gtg tct gac ccc tac cac gta gtg ate ctg ttc cag cca gtg 
Val Thr Val Ser Asp Pro Tyr His Val Val He Leu Phe Gin Pro Val 
40 45 50 

acc ctg ccc tgc acc tat cag atg age aac act etc aca gtc ccc ate 
Thr Leu Pro Cys Thr Tyr Gin Met Ser Asn Thr Leu Thr Val Pro He 
55 60 '65 *70 

gtg ate tgg aag tac aag tea ttc tgc egg gac cgt att gcc gat gcc 



283 



tgt etc ttt etc ate att ttc tgc cca gac cct gcc agt gcc ate cag 331 
Cys Leu Phe Leu He He Phe Cys Pro Asp Pro Ala Ser Ala He Gin 
25 30 35 



379 



427 



475 



1 



Val lie Trp Lys Tyr Lys Ser Phe Cys Arg Asp Arg lie Ala Asp Ala 

75 80 85 

ttc tct cct gcc agt gtg gac aac cag eta aat gec cag ttg gca get 523 

Phe Ser Pro Ala Ser Val Asp Asn Gin Leu Asn Ala Gin Leu Ala Ala 

90 95 100 

ggc aac ccc ggc tac aac ccc tat gtg gag tgc cag gac agt gta cgc 571 

Gly Asn Pro Gly Tyr Asn Pro Tyr Val Glu Cys Gin Asp Ser Val Arg 

105 110 115 



act gtc agg gtg gtg gcc acc aaa cag ggc aat gcg gtg acc ctg gga 
Thr Val Arg Val Val Ala Thr Lys Gin Gly Asn Ala Val Thr Leu Gly 
120 125 130 

gac tac tac caa ggc agg agg ate acc ata aca gga aat get gac ctg 
Asp Tyr Tyr Gin Gly Arg Arg lie Thr lie Thr Gly Asn Ala Asp Leu 
135 ^ 140 145 150 



ttt egg gcg ggg ccc ttg gaa gat tgg etc ttt gtg gtc gtg gtc tgc 
Phe Arg Ala Gly Pro Leu Glu Asp Trp Leu Phe Val Val Val Val Cys 
200 .205 210 

ctg gcg age etc etc etc ttc etc etc ctg ggc ate tgc tgg tgc cag 
Leu Ala Ser Leu Leu' Leu Phe Leu Leu Leu Gly lie Cys Trp Cys Gin 
215 220 225 230 

tgc tgt cct cac acc tgc tgc tgc tat gtc cga tgt ccc tgc tgc cca 
Cys Cys Pro His Thr Cys Cys Cys Tyr Val Arg Cys Pro Cys Cys Pro 

235 240 245 

gac aag tgc tgt tgc cct gag get ctt tat get get ggc aaa gca gcc 
Asp Lys Cys Cys Cys Pro Glu Ala Leu Tyr Ala Ala Gly Lys Ala Ala 

250 255 260 

acc tea ggt gtc ccg age ate tat gcc ccc age ate tat acc cac etc 
Thr Ser Gly Val Pro Ser lie Tyr Ala Pro Ser lie Tyr Thr His Leu 
265 270 275 

tea cct gcc aag acc cca cca cct ccg cct gcc atg att ccc atg ggc 
Ser Pro Ala Lys Thr Pro Pro Pro Pro Pro Ala Met He Pro Met Gly 
280 285 290 



619 



667 



acc ttc gag cag aca gcc tgg gga gac agt gga gtg tat tac tgc tct 715 
Thr Phe Glu Gin Thr Ala Trp Gly Asp Ser Gly Val Tyr Tyr Cys Ser 

155 160 165 

gtg gtc teg gcc caa gat ctg gat gga aac aac gag gcg tac gca gag 7 63 

Val Val Ser Ala Gin Asp Leu Asp Gly Asn Asn Glu Ala Tyr Ala Glu 

170 175 180 

etc ate gtc ctt ggc agg acc tea gag gcc cct gag etc eta cct ggt 811 
Leu He Val Leu Gly Arg Thr Ser Glu Ala Pro Glu Leu Leu Pro Gly 
185 190 195 



859 



907 



955 



1003 



1051 



1099 



cct. ccc tat ggg tac cct gga gac ttt gac aga cat age tea gtt ggt 1147 
Pro Pro Tyr Gly Tyr Pro Gly Asp Phe Asp Arg His Ser Ser Val Gly 



2 



(I 



295 300 305 310 

ggc cac age tec caa gta ccc ctg ctg cgt gac gtg gat ggc agt gta 
Gly His Ser Ser Gin Val Pro Leu Leu Arg Asp Val Asp Gly Ser Val 

315 320 325 

tct tea gaa gta cga agt ggc tac agg ate cag get aac cag caa gat 
Ser Ser Glu Val Arg Ser Gly Tyr Arg He Gin Ala Asn Gin Gin Asp 

330 335 340 

gac tec atg agg gtc eta tac tat atg gag aaa gag eta gec aac ttt 
Asp Ser Met Arg Val Leu Tyr Tyr Met Glu Lys Glu Leu Ala Asn Phe 
345 350 355 

gac cct tec cga cct ggc cct ccc aat ggc aga gtg gaa egg gec atg 
Asp Pro Ser Arg Pro Gly Pro Pro Asn Gly Arg Val Glu Arg Ala. Met 
360 365 370 

agt gaa gta ace tec etc cat gaa gat gac tgg cga teg agg cct tec 
Ser Glu Val Thr Ser Leu His Glu Asp Asp Trp Arg Ser Arg Pro Ser 
375 380 385 390 

agg get cct gee etc acc ccc ate agg gat gag gag tgg aat cgc cac 
Arg Ala Pro Ala Leu Thr Pro He Arg Asp Glu Glu Trp Asn Arg His 

395 400 405 

tec cca. cag agt ccc aga aca tgg gag cag gaa ccc ctt caa gaa caa 
Ser Pro Gin Ser Pro Arg Thr Trp Glu Gin Glu Pro Leu Gin Glu Gin 

410 415 420 

cca agg ggt ggt tgg ggg tct gga cgc cct egg gec cgc tct gtg gat 
Pro Arg Gly Gly Trp Gly Ser Gly Arg Pro Arg Ala Arg Ser Val Asp 
425 430 435 

get eta gat gat ate aac egg cct ggc tec act gaa tea gga egg tct 
Ala Leu Asp Asp He Asn Arg Pro Gly Ser Thr Glu Ser Gly Arg Ser 
440 445 450 

tct ccc cca agt agt gga egg aga gga egg gee tat gca cct cca aga 
Ser Pro Pro Ser Ser Gly Arg Arg Gly Arg Ala Tyr Ala Pro Pro Arg 
455 460 465 470 

agt cgc age egg gat gac etc tat gac ccg gac gat cct agg gac ttg 
Ser Arg Ser Arg Asp Asp Leu Tyr Asp Pro Asp Asp Pro Arg Asp Leu 

475 480 485 

cca cat tec cga gat ccc cac tat tat gac gac ate agg tct aga gat 
Pro His Ser Arg Asp Pro His Tyr Tyr Asp Asp He Arg Ser Arg Asp 

490 495 500 

cca cgt get gac ccc aga tec cgt cag cga tec cga gat cct egg gat 
Pro Arg Ala Asp Pro Arg Ser Arg Gin Arg Ser Arg Asp Pro Arg Asp 
505 ' 510 515 



get ggc ttc agg tea agg gac cct cag tat gat ggg cga eta tta gaa 
Ala Gly Phe Arg Ser Arg Asp Pro Gin Tyr Asp Gly Arg Leu Leu Glu 
520 525 530 



gag get tta aag aaa aag ggg teg ggc gag aga agg agg gtt tac agg 1867 

Glu Ala Leu Lys Lys Lys Gly Ser Gly Glu Arg Arg Arg Val Tyr Arg 

535 540 545 550 

gag gaa gaa gag gaa gag gag ggc caa tac ccc cca gca cct cca cct 1915 

Glu Glu Glu Glu Glu Glu Glu Gly Gin Tyr Pro Pro Ala Pro Pro Pro 

555 560 565 

tac tea gag act gac teg cag gee tea egg gag agg agg ctg aaa aag 1963 

Tyr Ser Glu Thr Asp Ser Gin Ala Ser Arg Glu Arg Arg Leu Lys Lys 

570 575 580 

aat ttg gee ctg agt egg gaa agt tta gtc gtc tga tccacgtttt 2009 

Asn Leu Ala Leu Ser Arg Glu Ser Leu Val Val * 
585 590 

gtatgtagct tttgtacttt ttttttaatt ggaatcaata ttgatgaaac ttcaagecta 2069 
ataaaatgtc taatcacaaa aaaaaaaa 2097 



<210> 2 
<211> 593 
<212> PRT 

<213> Rattus norvegicus 
<400> 2 

Met Ala Pro Ala Ala Gly Ala Cys Ala Gly Ala Pro Asp Ser His Pro 
1 5 10 15 

Ala Thr Val Val Phe Val Cys Leu Phe Leu He He Phe Cys Pro Asp 

20 25 30 

Pro Ala Ser Ala He Gin Val Thr Val Ser Asp Pro Tyr His Val Val 
35 40 45 

He Leu Phe Gin Pro Val Thr Leu Pro Cys Thr Tyr Gin Met Ser Asn 
50 55 60 



Thr Leu Thr Val 
65 

Asp Arg He Ala 



Asn Ala Gin Leu 

100 

Cys Gin Asp Ser 
115 

Asn Ala Val Thr 
130 

Thr Gly Asn Ala 



Pro He Val He 
70 

Asp Ala Phe Ser 
85 

Ala Ala Gly Asn 



Val Arg Thr Val 

120 

Leu Gly Asp Tyr 
135 

Asp Leu Thr Phe 



Trp Lys Tyr Lys 
75 

Pro Ala Ser Val 
90 

Pro Gly Tyr Asn 
105 

Arg Val Val Ala 



Tyr Gin Gly Arg 

140 

Glu Gin Thr Ala 



Ser Phe Cys Arg 

80 

Asp Asn Gin Leu 
95 

Pro Tyr Val Glu 
110 

Thr Lys Gin Gly 
125 

Arg He Thr He 



Trp Gly Asp Ser 



4 



II 



145 



150 



155 



160 



Gly Val Tyr Tyr Cys Ser Val Val Ser Ala Gin Asp Leu Asp Gly Asn 

165 170 175 

Asn Glu Ala Tyr Ala Glu Leu lie Val Leu Gly Arg Thr Ser Glu Ala 

180 185 190 

Pro Glu Leu Leu Pro Gly Phe Arg Ala Gly Pro Leu Glu Asp Trp Leu 
195 200 205 

Phe Val Val Val Val Cys Leu Ala Ser Leu Leu Leu Phe Leu Leu Leu 
210 215 220 

Gly lie Cys Trp Cys Gin Cys Cys Pro His Thr Cys Cys Cys Tyr Val 
225 230 235 240 

Arg Cys Pro Cys Cys Pro Asp Lys Cys Cys Cys Pro Glu Ala Leu Tyr 

245 250 255 

Ala Ala Gly Lys Ala Ala Thr Ser Gly Val Pro Ser lie Tyr Ala Pro 

260 265 270 

Ser lie Tyr Thr His Leu Ser Pro Ala Lys Thr Pro Pro Pro Pro Pro 
275 280 285 

Ala Met lie Pro Met Gly Pro Pro Tyr Gly Tyr Pro Gly Asp Phe Asp 
290 295 300 

Arg His Ser Ser Val Gly Gly His Ser Ser Gin Val Pro Leu Leu Arg 
305 310 315 320 

Asp Val Asp Gly Ser Val Ser Ser Glu Val Arg Ser Gly Tyr Arg lie 

325 330 335 

Gin Ala Asn Gin Gin Asp Asp Ser Met Arg Val Leu Tyr Tyr Met Glu 

340 345 350 

Lys Glu Leu Ala Asn Phe Asp Pro Ser Arg Pro Gly Pro Pro Asn Gly 
355 360 365 

Arg Val Glu Arg Ala Met Ser Glu Val Thr Ser Leu His Glu Asp Asp 
370 375 380 

Trp Arg Ser Arg Pro Ser Arg Ala Pro Ala Leu Thr Pro lie Arg Asp 
385 390 395 400 



Glu Glu Trp Asn Arg His Ser Pro Gin Ser Pro Arg Thr Trp Glu Gin 

405 410 415 



Glu Pro Leu Gin Glu Gin Pro Arg Gly Gly Trp Gly Ser Gly Arg Pro 

420 425 430 

Arg Ala Arg Ser Val Asp Ala Leu Asp Asp lie Asn Arg Pro Gly Ser 
435 440 445 

Thr Glu Ser Gly Arg Ser Ser Pro Pro Ser Ser Gly Arg Arg Gly Arg 



5 



450 455 460 

Ala Tyr Ala Pro Pro Arg Ser Arg Ser Arg Asp Asp Leu Tyr Asp Pro 
465 470 475 480 

Asp Asp Pro Arg Asp Leu Pro His Ser Arg Asp Pro His Tyr Tyr Asp' 

485 490 495 

Asp lie Arg Ser Arg Asp Pro Arg Ala Asp Pro Arg Ser Arg Gin Arg 

500 505 510 

Ser Arg Asp Pro Arg Asp Ala Gly Phe Arg Ser Arg Asp Pro Gin Tyr 
515 520 525 

Asp Gly Arg Leu Leu Glu Glu Ala Leu Lys Lys Lys Gly Ser Gly Glu 
530 535 540 

Arg Arg Arg Val Tyr Arg Glu Glu Glu Glu Glu Glu Glu Gly Gin Tyr 
545 550 555 560 

Pro Pro Ala Pro Pro Pro Tyr Ser Glu Thr Asp Ser Gin Ala Ser Arg 

565 570 575 

Glu Arg Arg Leu Lys Lys Asn Leu Ala Leu Ser Arg Glu Ser Leu Val 

580 585 590 

Val 

<210> 3 
<211> 2040 
<212> DNA 

<213> Rattus norvegicus 
<400> 3 

accgctcacc aggtcagttg tccccggaaa gccgaaggca tgagcttcgc ccaagttctt 60 
tttatgggtt agaactcctc cagagcgggg gaaaaaggac ttggaatagg ggcgggacgg 120 
agcacgcacc cttctccgcc ttggttctcg ccgcgccccc tactctcggg atacttggga 



gcg tgt get ggg gcg cct gac tec cac cca get ace gtg gtc ttc gtg 
Ala Cys Ala Gly Ala Pro Asp Ser His Pro Ala Thr Val Val Phe Val 

10 15 20 



180 



ggggacgege gggcaccgtc getgetagae ggccgcg atg gcg ccg gcg gec ggc 235 

Met Ala Pro Ala Ala Gly 

. 1 5 



283 



tgt etc ttt etc ate att ttc tgc cca gac cct gee agt gec ate cag 331 
Cys Leu Phe Leu lie lie Phe Cys Pro Asp Pro Ala Ser Ala He Gin 
25 30 35 

gtg act gtg tct gac ccc tac cac gta gtg ate ctg ttc cag cca gtg 379 
Val Thr Val Ser Asp Pro Tyr His Val Val He Leu Phe Gin Pro Val 
40 45 50 



acc ctg ccc tgc ace tat cag atg age aac act etc aca gtc ccc ate 



427 



6 



Thr Leu Pro Cys Thr Tyr Gin Met Ser Asn Thr Leu Thr Val Pro He 
55 60 65 70 

gtg ate tgg aag tac aag tea ttc tgc egg gac cgt att gee gat gee 475 

Val He Trp Lys Tyr Lys Ser Phe Cys Arg Asp Arg He Ala Asp Ala 

75 80 85 



ttc tct cct gee agt gtg gac aac cag eta aat gee cag ttg gca get 
Phe Ser Pro Ala Ser Val Asp Asn Gin Leu Asn Ala Gin Leu Ala Ala 

90 95 100 



act gtc agg gtg gtg gee ace aaa cag ggc aat gcg gtg ace ctg gga 

Thr Val Arg Val Val Ala Thr Lys Gin Gly Asn Ala Val Thr Leu Gly 
120 125 130 

gac tac tac caa ggc agg agg ate acc ata aca gga aat get gac ctg 

Asp Tyr Tyr Gin Gly Arg Arg He Thr He Thr Gly Asn Ala Asp Leu 

135 140 145 150 



etc etc etc ttc etc etc ctg ggc ate tgc tgg tgc cag tgc tgt cct 
Leu Leu Leu Phe Leu Leu Leu Gly He Cys Trp Cys Gin Cys Cys Pro 
200 205 210 

cac acc tgc tgc tgc tat gtc cga tgt ccc tgc tgc cca gac aag tgc 
His Thr Cys Cys Cys Tyr Val Arg Cys Pro Cys Cys Pro Asp Lys Cys 
215 220 225 230 

tgt tgc cct gag get ctt tat get get ggc aaa gca gee acc tea ggt 
Cys Cys Pro Glu Ala Leu Tyr Ala Ala Gly Lys Ala Ala Thr Ser Gly 

235 240 245 

gtc ccg age ate tat gee ccc age ate tat acc cac etc tea cct gee 
Val Pro Ser He Tyr Ala Pro Ser He Tyr Thr His Leu Ser Pro Ala 

250 255 260 

aag acc cca cca cct ccg cct gee atg att ccc atg ggc cct ccc tat 
Lys Thr Pro Pro Pro Pro Pro Ala Met He Pro Met Gly Pro Pro Tyr 
265 270 275 

ggg tac cct gga gac ttt gac aga cat age tea gtt ggt ggc cac age 
Gly Tyr Pro Gly Asp Phe Asp Arg His Ser Ser Val Gly Gly His Ser 



523 



ggc aac ccc ggc tac aac ccc tat gtg gag tgc cag gac agt gta cgc 571 
Gly Asn Pro Gly Tyr Asn Pro Tyr Val Glu Cys Gin Asp Ser Val Arg 
105 110 115 



619 



667 



acc ttc gag cag aca gee tgg gga gac agt gga gtg tat tac tgc tct 7.15 

Thr Phe Glu Gin Thr Ala Trp Gly Asp Ser Gly Val Tyr Tyr Cys Ser 

155 160 165 

gtg gtc teg gee caa gat ctg' gat gga aac aac gag gcg tac gca gag 763 

Val Val Ser Ala Gin Asp Leu Asp Gly Asn Asn Glu Ala Tyr Ala Glu 

170 175 180 

etc ate gtc ctt gat tgg etc ttt gtg gtc gtg gtc tgc ctg gcg age 811 

Leu He Val Leu Asp Trp Leu Phe Val Val Val Val Cys Leu Ala Ser 

185 190 195 



859 



907 



955 



1003 



1051 



1099 



7 

9 



II 



280 285 290 

tec caa gta ccc ctg ctg cgt gac gtg gat ggc agt gta tct tea gaa 1147 

Ser Gin Val Pro Leu Leu Arg Asp Val Asp Gly Ser Val Ser Ser Glu 

295 300 305 310 

gta cga agt ggc tac agg ate cag get aac cag caa gat gac tec atg 1195 

Val Arg Ser Gly Tyr Arg He Gin Ala Asn Gin Gin Asp Asp Ser Met 

315 320 325 

agg gtc eta tac tat atg gag aaa gag eta gee aac ttt gac cct tec 1243 

Arg Val Leu Tyr Tyr Met Glu Lys Glu Leu Ala Asn Phe Asp Pro Ser 

330 335 340 

cga cct ggc cct ccc aat ggc aga gtg gaa egg gee atg agt gaa gta 1291 

Arg Pro Gly Pro Pro Asn Gly Arg Val Glu Arg Ala Met Ser Glu Val 

345 .350 355 

acc tec etc cat gaa gat gac tgg cga teg agg cct tec agg get cct 1339 

Thr Ser Leu His Glu Asp Asp Trp Arg Ser Arg Pro Ser Arg Ala Pro 

360 365 370 



gee etc acc ccc ate agg gat gag gag tgg aat cgc cac tec cca cag 
Ala Leu Thr Pro He Arg Asp Glu Glu Trp Asn Arg His Ser Pro Gin 
375 380 385 390 



1387 



agt ccc aga aca tgg gag cag gaa ccc ctt caa gaa caa cca agg ggt 1435 
Ser Pro Arg Thr Trp Glu Gin Glu Pro Leu Gin Glu Gin Pro Arg Gly 

395 400 405 

ggt tgg ggg tct gga cgc cct egg gee cgc tct gtg gat get eta gat 1483 
Gly Trp Gly Ser Gly Arg Pro Arg Ala Arg Ser Val Asp Ala Leu Asp 

410 415 420 

gat ate aac egg cct ggc tec act gaa tea gga egg tct tct ccc cca 1531 
Asp He Asn Arg Pro Gly Ser Thr Glu Ser Gly Arg Ser Ser Pro Pro 
425 • 430 435 

agt agt gga egg aga gga egg gee tat gca cct cca aga agt cgc age 157 9 
Ser Ser Gly Arg Arg Gly Arg Ala Tyr Ala Pro Pro Arg Ser Arg Ser 
440 445 450 

egg gat gac etc tat gac ccg gac gat cct agg gac ttg cca cat tec 1627 
Arg Asp Asp Leu Tyr Asp Pro Asp Asp Pro Arg Asp Leu Pro His Ser 
455 460 465 470 

cga gat ccc cac tat tat gac gac ate agg tct aga gat cca cgt get 1675 
Arg Asp Pro His Tyr Tyr Asp Asp He Arg Ser Arg Asp Pro Arg Ala 

475 480 485 

gac ccc aga tec cgt cag cga tec cga gat cct egg gat get ggc ttc 1723 
Asp Pro Arg Ser Arg Gin Arg Ser Arg Asp Pro Arg Asp Ala Gly Phe 

490 495 500 . 

agg tea agg gac cct cag tat gat ggg cga eta tta gaa gag get tta 1771 
Arg Ser Arg Asp Pro Gin Tyr Asp Gly Arg Leu Leu Glu Glu Ala Leu 
505 510 515 



8 



aag aaa aag ggg teg ggc gag aga agg agg gtt tac agg gag gaa gaa 
Lys Lys Lys Gly Ser Gly Glu Arg Arg Arg Val Tyr Arg Glu Glu Glu 
520 525 530 

gag gaa gag gag ggc caa tac ccc cca gca cct cca cct tac tea gag 
Glu Glu Glu Glu Gly Gin Tyr Pro Pro Ala Pro Pro Pro Tyr Ser Glu 
535 . 540 545 550 

act gac teg cag gec tea egg gag agg agg ctg aaa aag aat ttg gee 
Thr Asp Ser Gin Ala Ser Arg Glu Arg Arg Leu Lys Lys Asn Leu Ala 

555 560 565 

ctg agt egg gaa agt tta gtc gtc tga tccacgtttt gtatgtagct 
Leu Ser Arg Glu Ser Leu Val Val * 

570 

tttgtacttt ttttttaatt ggaatcaata ttgatgaaac ttcaagecta ataaaatgtc 
taatcacaaa aaaaaaaa 



<210> 4 
<211> 574 
<212> PRT 

<213> Rattus norvegicus 
<400> 4 

Met Ala Pro Ala Ala Gly Ala Cys Ala Gly Ala Pro Asp Ser His Pro 
15 10 15 

Ala Thr Val Val Phe Val Cys Leu Phe Leu He He Phe Cys Pro Asp 

20 25 30 

Pro Ala Ser Ala He Gin Val Thr Val Ser Asp Pro Tyr His Val Val 
35 40 45 

He Leu Phe Gin Pro Val Thr Leu Pro Cys Thr Tyr Gin Met Ser Asn 
50 55 .60 

Thr Leu Thr Val Pro He Val He Trp Lys Tyr Lys Ser Phe Cys Arg 
65 70 75 80 

Asp Arg He Ala Asp Ala Phe Ser Pro Ala Ser Val Asp Asn Gin Leu 

85 90 95 

Asn Ala Gin Leu Ala Ala Gly Asn Pro Gly Tyr Asn Pro Tyr Val Glu 

100 105 HO 

Cys Gin Asp Ser Val Arg Thr Val Arg Val Val Ala Thr Lys Gin Gly 
115 120 125 

Asn Ala Val Thr Leu Gly Asp Tyr Tyr Gin Gly Arg Arg He Thr He 
130 135 140 

Thr Gly Asn Ala Asp Leu Thr Phe Glu Gin Thr Ala Trp Gly Asp Ser 
145 " 150 155 160 



Gly Val Tyr Tyr 



Asn Glu Ala Tyr 

180 

Val Val Cys Leu 
195 

Trp Cys Gin Cys 
210 

Cys Cys Pro Asp 
225 

Lys Ala Ala Thr 



Thr His Leu Ser 

260 

Pro Met Gly Pro 
275 

Ser Val Gly Gly 
290 

Gly Ser Val Ser 
305 

Gin Gin Asp Asp 



Ala Asn Phe Asp 

340 

Arg Ala Met Ser 
355 

Arg Pro Ser Arg 
370 

Asn Arg His Ser 
385 

Gin Glu Gin Pro 



Ser Val Asp Ala 

420 

Gly Arg Ser Ser 
435 

Pro Pro Arg Ser 
450 



Cys Ser Val Val 
165 

Ala Glu Leu lie 



Ala Ser Leu Leu 

200 

Cys Pro His Thr 
215 

Lys Cys Cys Cys 
230 

Ser Gly Val Pro 
245 

Pro Ala Lys Thr 



Pro Tyr Gly Tyr 

280 

His Ser Ser Gin 
295 

Ser Glu Val Arg 
310 

Ser Met Arg Val 
325 

Pro Ser Arg Pro 



Glu Val Thr Ser 

360 

Ala Pro Ala Leu 
375 

Pro Gin Ser Pro 
390 

Arg Gly Gly Trp 
405 

Leu Asp Asp lie 



Pro Pro Ser Ser 

440 

Arg Ser Arg Asp 
455 



Ser Ala Gin Asp 
170 

Val Leu Asp Trp 
185 

Leu Phe Leu Leu 



Cys Cys Cys Tyr 

220 

Pro Glu Ala Leu 
235 

Ser lie Tyr Ala 
250 

Pro Pro Pro Pro 
265 

Pro Gly Asp Phe 



Val Pro Leu Leu 

300 

Ser Gly Tyr Arg 
315 

Leu Tyr Tyr Met 
330 

Gly Pro Pro Asn 
345 

Leu His Glu Asp 



Thr Pro lie Arg 

380 

Arg Thr Trp Glu 
395 

Gly Ser Gly Arg 
410 

Asn Arg Pro Gly 
425 

Gly Arg Arg Gly 



Asp Leu Tyr Asp 

460 



Leu Asp Gly Asn 
175 

Leu Phe Val Val 
190 

Leu Gly He Cys 
205 

Val Arg Cys Pro 



Tyr Ala Ala Gly 

240 

Pro Ser He Tyr 
255 

Pro Ala Met He 
270 

Asp Arg His Ser 
285 

Arg Asp -Val Asp 



He Gin Ala Asn 

320 

Glu Lys Glu Leu 
335 

Gly Arg Val Glu 
350 

Asp Trp Arg Ser 
365 

Asp Glu Glu Trp 



Gin Glu Pro Leu 

400 

Pro Arg Ala Arg 
415 . 

Ser Thr Glu Ser 
430 

Arg Ala Tyr Ala 
445 

Pro Asp Asp Pro 



10 



Arg Asp Leu Pro His Ser Arg Asp Pro His Tyr Tyr Asp Asp He Arg 
465 470 475 480 

Ser Arg Asp Pro Arg Ala Asp Pro Arg Ser Arg Gin Arg Ser Arg Asp 

485 490 495 

Pro Arg Asp Ala Gly Phe Arg Ser Arg Asp Pro Gin Tyr Asp Gly Arg 

500 505 510 

Leu Leu Glu Glu Ala Leu Lys Lys Lys Gly Ser Gly Glu Arg Arg Arg 
515 520 525 

Val Tyr Arg Glu Glu Glu Glu Glu Glu Glu Gly Gin Tyr Pro Pro Ala 
530 535 540 

Pro Pro Pro Tyr Ser Glu Thr Asp Ser Gin Ala Ser Arg Glu Arg Arg 
545 550 555 560 

Leu Lys Lys Asn Leu Ala Leu Ser Arg Glu Ser Leu Val Val 

565 570 



<210> 5 
<211> 1893 
<212> DNA 

<213> Rattus norvegicus 
<400> 5 

accgctcacc aggtcagttg tccccggaaa gccgaaggca tgagcttcgc ccaagttctt 

tttatgggtt agaactcctc cagagcgggg gaaaaaggac ttggaatagg ggcgggacgg 

agcacgcacc cttctecgcc ttggttctcg ccgcgccccc tactctcggg atacttggga 

ggggacgcgc gggcaccgtc gctgctagac ggccgcg atg gcg ccg gcg gcc ggc 

Met Ala Pro Ala Ala Gly 
1 5 

gcg tgt get ggg gcg cct gac tec cac cca get ace gtg gtc ttc gtg 
Ala Cys Ala Gly Ala Pro Asp Ser His Pro Ala Thr Val Val Phe Val 

10 15 20 

tgt etc ttt etc ate att ttc tgc cca gac cct gcc agt gcc ate cag 
Cys Leu Phe Leu He He Phe Cys Pro Asp Pro Ala Ser Ala He Gin 
25 30 35 

gtg act gtg tct gac ccc tac cac gta gtg ate ctg ttc cag cca gtg 
Val Thr Val Ser Asp Pro Tyr His Val Val lie Leu Phe Gin Pro Val 
40 45 50 

acc ctg ccc tgc ace tat cag atg age aac act etc aca gtc ccc ate 
Thr Leu Pro Cys Thr Tyr Gin Met Ser Asn Thr Leu Thr Val Pro He 
55 60 65 70 

gtg ate tgg aag tac aag tea ttc tgc egg gac cgt att gcc gat gcc 
Val He Trp Lys Tyr Lys Ser Phe Cys Arg Asp Arg He Ala Asp Ala 



11 



75 80 85 

ttc tct cct gcc agt gtg gac aac cag eta aat gec cag ttg gca get 
Phe Ser Pro Ala Ser Val Asp Asn Gin Leu Asn Ala Gin Leu Ala Ala 

90 95 100 

ggc aac ccc ggc tac aac ccc tat gtg gag tgc cag gac agt gta cgc 
Gly Asn Pro Gly Tyr Asn Pro Tyr Val Glu Cys Gin Asp Ser Val Arg 
105 ■ 110 115 

act gtc agg gtg gtg gcc acc aaa cag ggc aat gcg gtg acc ctg gga 
Thr Val Arg Val Val Ala Thr Lys Gin Gly Asn Ala Val Thr Leu Gly 
120 125 130 

gac tac tac caa ggc agg agg ate acc ata aca gga aat get gac ctg 
Asp Tyr Tyr Gin Gly Arg Arg lie Thr lie Thr Gly Asn Ala Asp Leu 
135 140 145 150 

acc ttc gag cag aca gcc tgg gga gac agt gga gtg tat tac tgc tct 
Thr Phe Glu Gin Thr Ala Trp Gly Asp Ser Gly Val Tyr Tyr Cys Ser 

155 ' 160 165 

gtg gtc teg gcc caa gat ctg gat gga aac aac gag gcg tac gca gag 
Val Val Ser Ala Gin Asp Leu Asp Gly Asn Asn Glu Ala Tyr Ala Glu 

170 175 180 

etc ate gtc ctt gtt tat get get ggc aaa gca gcc acc tea ggt gtc 
Leu He Val Leu Val Tyr Ala Ala Gly Lys Ala Ala Thr Ser Gly Val 
185 190 195 

ccg age ate tat gcc ccc age ate tat acc cac etc tea cct gcc aag 
Pro Ser He Tyr Ala Pro Ser He Tyr Thr His Leu Ser Pro Ala Lys 
200 205 210 

acc cca cca cct ccg cct gcc atg att ccc atg ggc cct ccc tat ggg 
Thr Pro Pro Pro Pro Pro Ala Met He Pro Met Gly Pro Pro Tyr Gly 
215 220 225 230 

tac cct gga gac ttt gac aga cat age tea gtt ggt ggc cac age tec 
Tyr Pro Gly Asp Phe Asp Arg His Ser Ser Val Gly Gly His Ser Ser 

235 240 245 

caa gta ccc ctg ctg cgt gac gtg gat ggc agt gta tct tea gaa gta 
Gin Val Pro Leu Leu Arg Asp Val Asp Gly Ser Val Ser Ser Glu Val 

250 255 260 

cga agt ggc tac agg ate cag get aac cag caa gat gac tec atg agg 
Arg Ser Gly Tyr Arg lie Gin Ala Asn Gin Gin Asp Asp Ser Met Arg 
265 270 275 

gtc eta tac tat atg gag aaa gag eta gcc aac ttt gac cct tec cga 
Val Leu Tyr Tyr Met Glu Lys Glu Leu Ala Asn Phe Asp Pro Ser Arg 
280 285 290 

cct ggc cct ccc aat ggc aga gtg gaa egg gcc atg agt gaa gta acc 
Pro Gly Pro Pro Asn Gly Arg Val Glu Arg Ala Met Ser Glu Val Thr 
295 " 300 ~ 305 310 
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tec etc cat gaa gat gac tgg cga teg agg cct tec agg get cct gee 
Ser Leu His Glu Asp Asp Trp Arg Ser Arg Pro Ser Arg Ala Pro Ala 

315 320 325 

etc acc ccc ate agg gat gag gag tgg aat cgc cac tec cca cag agt 
Leu Thr Pro He Arg Asp Glu Glu Trp Asn Arg His Ser Pro Gin Ser 

330 335 340 

ccc aga aca tgg gag cag gaa ccc ctt caa gaa caa cca agg ggt ggt 
Pro Arg Thr Trp Glu Gin Glu Pro Leu Gin Glu Gin Pro Arg Gly Gly 
345 350 355 . 

tgg ggg tct gga cgc cct egg gee cgc tct gtg gat get eta gat gat 
Trp Gly Ser Gly Arg Pro Arg Ala Arg Ser Val Asp Ala Leu Asp Asp 
360 365 370 

ate aac egg cct ggc tec act gaa tea gga egg tct tct ccc cca agt 
He Asn Arg Pro Gly Ser Thr Glu Ser Gly Arg Ser Ser Pro Pro Ser 
375 380 385 390 

agt gga egg aga gga egg gec tat gca cct cca aga agt cgc age egg 
Ser Gly Arg Arg Gly Arg Ala Tyr Ala Pro Pro Arg Ser Arg Ser Arg 

395 400 405 

gat gac etc tat gac ccg gac gat cct agg gac ttg cca cat tec cga 
Asp Asp Leu Tyr Asp Pro Asp Asp Pro Arg Asp Leu Pro His Ser Arg 

410 415 420 

gat ccc cac tat tat gac gac ate agg tct aga gat cca cgt get gac 
Asp Pro His Tyr Tyr Asp Asp He 'Arg Ser Arg Asp Pro Arg Ala Asp 
425 430 435 



ccc aga tec cgt cag cga tec cga gat cct egg gat get ggc ttc agg 
Pro Arg Ser Arg Gin Arg Ser Arg Asp Pro Arg Asp Ala Gly Phe Arg 
440 445 450 

tea agg gac cct cag tat gat ggg cga eta tta gaa gag get tta aag 
Ser Arg Asp Pro Gin Tyr Asp Gly Arg Leu Leu Glu Glu Ala Leu Lys 
455 460 465 470 

aaa aag ggg teg ggc gag aga agg agg gtt tac agg gag gaa gaa gag 
Lys Lys Gly Ser Gly Glu Arg Arg Arg Val Tyr Arg Glu Glu Glu Glu 

475 480 485 

gaa gag gag ggc caa tac ccc cca gca cct cca cct tac tea gag act 
Glu Glu Glu Gly Gin Tyr Pro Pro Ala Pro Pro Pro Tyr Ser Glu Thr 

490 495 500 

gac teg cag gec tea egg gag agg agg ctg aaa aag aat ttg gee ctg 
Asp Ser Gin Ala Ser Arg Glu Arg Arg Leu Lys Lys Asn Leu Ala Leu 
.505 510 515 

agt egg gaa agt tta gtc gtc tga tccacgtttt gtatgtagct tttgtacttt 
Ser Arg Glu Ser Leu Val Val * 
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1893 



520 525 
ttttttaatt ggaatcaata ttgatgaaac ttcaagccta ataaaatgtc taatcacaaa 1885 

aaaaaaaa 



<210> 6 
<211> 525 
<212> PRT 

<213> Rattus norvegicus 



<400> 6 

Met Ala Pro Ala Ala Gly Ala Cys Ala Gly Ala Pro Asp Ser His Pro 
15 10 15 

Ala Thr Val Val Phe Val Cys Leu Phe Leu He He Phe Cys Pro Asp 

20 25 30 

Pro Ala Ser Ala He Gin Val Thr Val Ser Asp Pro Tyr His Val Val 
35 40 ' 45 

He Leu Phe Gin Pro Val Thr Leu Pro Cys Thr Tyr Gin Met Ser Asn 
50 55 60 

Thr Leu Thr Val Pro He Val He Trp Lys Tyr Lys Ser Phe Cys Arg 
65 70 75 80 

Asp Arg He Ala Asp Ala Phe Ser Pro Ala Ser Val Asp Asn Gin Leu 

85 90 95 

Asn Ala Gin Leu Ala Ala Gly Asn Pro Gly Tyr Asn Pro Tyr Val Glu 

100 105 110 

Cys Gin Asp Ser Val Arg Thr Val Arg Val Val Ala Thr Lys Gin Gly 
115 120 125 

Asn Ala Val Thr Leu Gly Asp Tyr Tyr Gin Gly Arg Arg lie Thr He 
130 135 140 

Thr Gly Asn Ala Asp Leu Thr Phe Glu Gin Thr Ala Trp Gly Asp Ser 
145 150 155 160 

Gly Val Tyr Tyr Cys Ser Val Val Ser Ala Gin Asp Leu Asp Gly Asn 

165 170 175 

Asn Glu Ala Tyr Ala Glu Leu He Val Leu Val Tyr Ala Ala Gly Lys 

180 185 190 

Ala Ala Thr Ser Gly Val Pro Ser He Tyr Ala Pro Ser lie Tyr Thr 
195 200 205 

His Leu Ser Pro Ala Lys Thr Pro Pro Pro Pro Pro Ala Met He Pro 
210 215 220 

Met Gly Pro Pro Tyr Gly Tyr Pro Gly Asp Phe Asp Arg His Ser Ser 
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225 

Val Gly Gly His 



Ser Val Ser Ser 

260 

Gin Asp Asp Ser 
275 

Asn Phe Asp Pro 
290 

Ala Met Ser Glu 
305 

Pro Ser Arg Ala 



Arg His Ser Pro 

340 

Glu Gin Pro Arg 
355 

Val Asp Ala Leu 
370 

Arg Ser Ser Pro 
385 

Pro Arg Ser Arg 



Asp Leu Pro His 

420 

Arg Asp Pro Arg 

435 

Arg Asp Ala Gly 
450 

Leu Glu Glu Ala 
465 

Tyr Arg Glu Glu 



Pro Pro Tyr Ser 

500 

Lys Lys Asn Leu 
515 



230 

Ser Ser Gin Val 
245 

Glu Val Arg Ser 



Met Arg Val Leu 

280 

Ser Arg Pro Gly 
295 

Val Thr Ser Leu 
310 

Pro Ala Leu Thr 
325 

Gin Ser Pro Arg 



Gly Gly Trp Gly 

360 

Asp Asp lie Asn 
375 

Pro Ser Ser Gly 
390 

Ser Arg Asp Asp 
405 

Ser Arg Asp Pro 



Ala Asp Pro Arg 

440 

Phe Arg Ser Arg 
455 

Leu Lys Lys Lys 
■ 470 

Glu Glu Glu Glu 
485 

Glu Thr Asp Ser 



Ala Leu Ser Arg 

520 



235 

Pro Leu Leu Arg 
250 

Gly Tyr Arg lie 
265 

Tyr Tyr Met Glu 



Pro Pro Asn Gly 

300 

His Glu Asp Asp 
315 

Pro lie Arg Asp 
330 

Thr Trp Glu Gin 
345 

Ser Gly Arg Pro 



Arg Pro Gly Ser 

380 

Arg Arg Gly Arg 
395 

Leu Tyr Asp Pro 
410 

His Tyr Tyr Asp 
425 

Ser Arg Gin Arg 



Asp Pro Gin Tyr 

460 

Gly Ser Gly Glu 
475 

Glu Gly Gin Tyr 
490 

Gin Ala Ser Arg 
505 

Glu Ser Leu Val 



240 

Asp Val Asp Gly 
255 

Gin Ala Asn Gin 
270 

Lys Glu Leu Ala 
285 

Arg Val Glu Arg 



Trp Arg Ser Arg 

320 

Glu Glu Trp Asn 
335 

Glu Pro Leu Gin 
350 

Arg Ala Arg Ser 
365 

Thr Glu Ser Gly 



Ala Tyr Ala Pro 

400 

Asp Asp Pro Arg 
415 

Asp lie Arg Ser 
430 

Ser Arg Asp Pro 
445 

Asp Gly Arg Leu 



Arg Arg Arg Val 

480 

Pro Pro Ala Pro 
495 

Glu Arg Arg Leu 
510 

Val 
525 



15 



« 



<210> 7 
<211> 2158 
<212> DNA 

<213> Homo sapiens 

<220> 

<221> Misc_Feature 
<222> 1259. .1261 

<223> Potential splicing site AAG 

<221> Misc_Feature 
<222> 1657 

<223> Potential insertion of a AGG 
<400> 7 

tggagtgtgg ctcggaggac cgcggcgggt caagcacctt tctcccccat atctgaaagc 60 

atgccctttg tccacgtcgt ttacgctcat taaaacttcc aga atg caa cag gac 115 

Met Gin Gin Asp 
1 

gga ctt gga gta ggg aca agg aac gga agt ggg aag ggg agg age gtg 163 
Gly Leu Gly Val Gly Thr Arg Asn Gly Ser Gly Lys Gly Arg Ser Val 
5 10 15 20 

cac ccc tec tgg cct tgg tgc gcg ccg cgc ccc eta agg tac ttt gga 211 
His Pro Ser Trp Pro Trp Cys Ala Pro Arg Pro Leu Arg Tyr Phe Gly 

25 30 35 



agg gac gcg egg gee aga cgc gec cag acg gee gcg atg gcg ctg ttg 

Arg Asp Ala Arg Ala Arg Arg Ala Gin Thr Ala Ala Met Ala Leu Leu 

40 45 50 

gee ggc ggg etc tec aga ggg ctg ggc tec cac ccg gee gec gca ggc 

Ala Gly Gly Leu Ser Arg Gly Leu Gly Ser His Pro Ala Ala Ala Gly 
55 60 65 



259 



307 



egg gac gcg gtc gtc ttc gtg tgg ctt ctg ctt age acc tgg tgc aca 355 
Arg Asp Ala Val Val Phe Val Trp Leu Leu Leu Ser Thr Trp Cys Thr 
70 75 80 



get cct gec agg gec ate cag gtg acc gtg tec aac ccc tac cac gtg 

Ala Pro Ala Arg Ala He Gin Val Thr Val Ser Asn Pro Tyr His Val 
85 90 95 100 

gtg ate etc ttc cag cct gtg acc ctg ccc tgt acc tac cag atg acc 

Val He Leu Phe Gin Pro Val Thr Leu Pro Cys Thr Tyr Gin Met Thr 

105 HO 115 

teg acc ccc acg caa ccc ate gtc ate tgg aag tac aag tct ttc tgc 

Ser Thr Pro Thr Gin Pro He Val He Trp Lys Tyr Lys Ser Phe Cys 

120 125 130 



403 



451 



499 
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egg gac cgc ate gee gat gee ttc tec ccg gee age gtc gac aac cag 
Arg Asp Arg He Ala Asp Ala Phe Ser Pro Ala Ser Val Asp Asn Gin 
135 140 145 

etc aat gee cag ctg gca gee ggg aac cca ggc tac aac ccc tac gtt 
Leu Asn Ala Gin Leu Ala Ala Gly Asn Pro Gly Tyr Asn Pro Tyr Val 
150 155 160 

gag tgc cag gac age gtg cgc acc gtc agg gtc gtg gec acc aag cag 
Glu Cys Gin Asp Ser Val Arg Thr Val Arg Val Val Ala Thr Lys Gin 
165 170 175 180 

ggc aac get gtg acc ctg gga gat tac tac cag ggc egg agg att acc 
Gly Asn Ala Val Thr Leu Gly Asp Tyr Tyr Gin Gly Arg Arg He Thr 

185 190 195 



547 



aac aat gag gee tac gca gag etc ate gtc ctt ggg agg acc tea ggg 
Asn Asn Glu Ala Tyr Ala Glu Leu He Val Leu Gly Arg Thr Ser Gly 
230 235. 240 

gtg get gag etc tta cct ggt ttt cag gcg ggg ccc ata gaa gac tgg 
Val Ala Glu Leu Leu Pro Gly Phe Gin Ala Gly Pro He Glu Asp Trp 
245 250 255 260 

etc ttc gtg gtt gtg gta tgc ctg get gee ttc etc ate ttc etc etc 
Leu Phe Val Val Val Val Cys Leu Ala Ala Phe Leu He Phe Leu Leu 

265 270 275 

ctg ggc ate tgc tgg tgc cag tgc tgc ccg cac act tgc tgc tgc tac 
Leu Gly He Cys Trp Cys Gin Cys Cys Pro His Thr Cys Cys Cys Tyr 

280 285 290 

gtc agg tgc ccc tgc tgc cca gac aag tgc tgc tgc ccc gag gee ctg 
Val Arg Cys Pro Cys Cys Pro Asp Lys Cys Cys Cys Pro Glu Ala Leu 
295 300 305 

tat gee gee ggc aaa gca gee acc tea ggt gtt ccc age att tat gee 
Tyr Ala Ala Gly Lys Ala Ala Thr Ser Gly Val Pro Ser He Tyr Ala 
310 315 320 

ccc age acc tat gec cac ctg tct ccc gec aag acc cca ccc cca cca 
Pro Ser Thr Tyr Ala His Leu Ser Pro Ala Lys Thr Pro Pro Pro Pro 
325 330 335 340 



595 



643 



691 



ate acc gga aat get gac ctg acc ttt gac cag acg gcg tgg ggg gac 739 
He Thr Gly Asn Ala Asp Leu Thr Phe Asp Gin Thr Ala Trp Gly Asp 

200 205 210 

agt ggt gtg tat tac tgc tec gtg gtc tea gec cag gac etc cag ggg 787 
Ser Gly Val Tyr Tyr Cys Ser Val Val Ser Ala Gin Asp Leu Gin Gly 
215 ~ 220 225 



835 



883 



931 



979 



1027 



1075 



1123 



get atg att ccc atg ggc cct gee tac aac ggg tac cct gga gga tac 1171 

Ala Met He Pro Met Gly Pro Ala Tyr Asn Gly Tyr Pro Gly Gly Tyr 

345 350 355 

cct gga gac gtt gac agg agt age tea get ggt ggc caa ggc tec tat 1219 
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Pro Gly Asp Val Asp Arg Ser Ser Ser Ala Gly Gly Gin Gly Ser Tyr 

360 365 370 



gta ccc ctg 
Val Pro Leu 
375 

agt ggc tac 
Ser Gly Tyr 
390 

ctg tac tac 
Leu Tyr Tyr 
405 

ggc ccc ccc 
Gly Pro Pro 



etc cac gag 
Leu His Glu 



acc ccg ate 
Thr Pro lie 
455 

agg gga tgg 
Arg Gly Trp 
470 

egg gec agg 
Arg Ala Arg 
485 

acc ccg ccg 
Thr Pro Pro 



ggt ggg aga 
Gly Gly Arg 



gac etc tat 
Asp Leu Tyr 
535 

ccc cac tac 
Pro His Tyr 
550 

agg tec cac 
Arg Ser His 
565 



ctt egg gac 
Leu Arg Asp 



agg att cag 
Arg lie Gin 



atg gag aag 
Met Glu Lys 
410 

agt ggc cgt 
Ser Gly Arg 
425 

gac gac tgg 
Asp Asp Trp 
440 

egg gat gag 
Arg Asp Glu 



gac cag gag 
Asp Gin Glu 



egg ccc egg 
Arg Pro Arg 
490 

age acc gee 
Ser Thr Ala 
505 

age egg gec 
Ser Arg Ala 
520 

gac caa gac 
Asp Gin Asp 



gac gac ttc 
Asp Asp Phe 



cac cac cgt 
His His Arg 
570 



acg gac age 
Thr Asp Ser 
380 

gee age cag 
Ala Ser Gin 
395 

gag ctg gee 
Glu Leu Ala 



gtg gag egg 
Val Glu Arg 



cga tct egg 
Arg Ser Arg 
445 

gag tgg ggt 
Glu Trp Gly 
460 

ccc gee agg 
Pro Ala Arg 
475 

gec cgc tec 
Ala Arg Ser 



gag tea ggg 
Glu Ser Gly 



tac atg ccc 
Tyr Met Pro 
525 

gac teg agg 
Asp Ser Arg 
540 

agg tct egg 
Arg Ser Arg 
555 

acc egg gac 
Thr Arg Asp 



agt gtg gee 
Ser Val Ala 



cag gac gac 
Gin Asp Asp 
400 

aac ttc gac 
Asn Phe Asp 
415 

gee atg agt 
Ala Met Ser 
430 

cct tec egg 
Pro Ser Arg 



ggc cac tec 
Gly His Ser 



gag cag gca 
Glu Gin Ala 
480 

gtg gac gee 
Val Asp Ala 
495 

age agg tct 
Ser Arg Ser 
510 

ccg egg age 
Pro Arg Ser 



gac ttc cca 
Asp Phe Pro 



gag cgc cct 
Glu Arg Pro 
560 

cct egg gac 
Pro Arg Asp 
575 



tct gaa gtc 
Ser Glu Val 
385 

tec atg egg 
Ser Met Arg 



cct tct cga 
Pro Ser Arg 



gaa gtc acc 
Glu Val Thr 
435 

ggc cct gee 
Gly Pro Ala 
450 

ccc egg agt 
Pro Arg Ser 
465 

ggc ggg ggc 
Gly Gly Gly 



ctg gac gac 
Leu Asp Asp 



ccc acg agt 
Pro Thr Ser 
515 

cgc age egg 
Arg Ser Arg 
530 

cgc tec egg 
Arg Ser Arg 
545 

cct gee gac 
Pro Ala Asp 



aac ggc tec 
Asn Gly Ser 



cgc 1267 
Arg 

gtc 1315 
Val 



cct 1363 

Pro 

420 

tec 1411 
Ser 



etc 1459 
Leu 



ccc 1507 
Pro 



tgg 1555 
Trp 

etc 1603 

Leu 

500 

aat 1651 
Asn 



gac 1699 
Asp 

gac 1747 
Asp 

ccc 1795 
Pro 



agg 1843 

Arg 

580 
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tec ggg gac etc ccc tat gat ggg egg eta ctg gag gag get gtg agg 18 91 
Ser Gly Asp Leu Pro Tyr Asp Gly Arg Leu Leu Glu Glu Ala Val Arg 

585 590 595 

aag aag ggg teg gag gag agg agg aga ccc cac aag gag gag gag gaa 1939 
Lys Lys Gly Ser Glu Glu Arg Arg Arg Pro His Lys Glu Glu Glu Glu 

600 605 610 



gag gec tac tac ccg ccc gcg ccg ccc ccg tac teg gag ace gac teg 
Glu Ala Tyr Tyr Pro Pro Ala Pro Pro Pro Tyr Ser Glu Thr Asp Ser 
615 620 625 

cag gcg tec cga gag cgc agg etc aag aag aac ttg gee ctg agt egg 
Gin Ala Ser Arg Glu Arg Arg Leu Lys Lys Asn Leu Ala Leu Ser Arg 
630 635 6'40 

gaa agt tta gtc gtc tga tctgacgttt tetaegtage ttttgtattt 

Glu Ser Leu Val Val * 

645 



1987 



2035 



2083 



ttttttttaa tttgaaggaa cactgatgaa gccctgccat acccctcccg agtctaataa 214 3 
aacgtataat cacaa 2158 



<210> 8 
<211> 649 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> Misc_Feature 
<222> 386 

<223> Potential deletion of a Glu 



<221> Misc_Feature 
<222> 518 

<223> Potential insertion of a Arg 
<400> 8 

Met Gin Gin Asp Gly Leu Gly Val Gly Thr Arg Asn Gly Ser Gly Lys 
15 10 15 

» 

Gly Arg Ser Val His Pro Ser Trp Pro Trp Cys Ala Pro Arg Pro Leu 

20 25 30 

Arg Tyr Phe Gly Arg Asp Ala Arg Ala Arg Arg Ala Gin Thr Ala Ala 
35 40 45 

Met Ala Leu Leu Ala Gly Gly Leu Ser Arg Gly Leu Gly Ser His Pro 
50 55 60 

Ala Ala Ala Gly Arg Asp Ala Val Val Phe Val Trp Leu Leu Leu Ser 
65 ' 70 75 80 
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Thr Trp Cys Thr Ala Pro Ala Arg Ala He Gin Val Thr Val Ser Asn 

85 90 . 95 

Pro Tyr His Val Val He Leu Phe Gin Pro Val Thr Leu Pro Cys Thr 

100 105 110 

Tyr Gin Met Thr Ser Thr Pro Thr Gin Pro He Val He Trp Lys Tyr 
115 120 125 

Lys Ser Phe Cys Arg Asp Arg He Ala Asp Ala Phe Ser Pro Ala Ser 
130 135 140 

Val Asp Asn Gin Leu Asn Ala Gin Leu Ala Ala Gly Asn Pro Gly Tyr 
145 150 155 160 

Asn Pro Tyr Val Glu Cys Gin Asp Ser Val Arg Thr Val Arg Val Val 

165 170 175 

Ala Thr Lys Gin Gly Asn Ala Val Thr Leu Gly Asp Tyr Tyr Gin Gly 

180 185 190 

Arg Arg He Thr He Thr Gly Asn Ala Asp Leu Thr Phe Asp Gin Thr 
195 200 205 

Ala Trp Gly Asp Ser Gly Val Tyr Tyr Cys Ser Val Val Ser Ala Gin 
210 215 220 

Asp Leu Gin Gly Asn Asn Glu Ala Tyr Ala Glu Leu He Val Leu Gly 
225 230 235 240 

Arg Thr Ser Gly Val Ala Glu Leu Leu Pro Gly Phe Gin Ala Gly Pro 

245 250 255 

lie Glu Asp Trp Leu Phe Val Val Val Val Cys Leu Ala Ala Phe Leu 

260 265 270 

He Phe Leu Leu Leu Gly lie Cys Trp Cys Gin Cys Cys Pro His Thr 
275 280 285 

Cys Cys Cys Tyr Val Arg Cys Pro Cys Cys Pro Asp Lys Cys Cys Cys 
290 295 300 

Pro Glu Ala Leu Tyr Ala Ala Gly Lys Ala Ala Thr Ser Gly Val Pro 
305 310 315 320 

Ser He Tyr Ala Pro Ser Thr Tyr Ala His Leu Ser Pro Ala Lys Thr 

325 330 335 



Pro Pro Pro Pro Ala Met lie Pro Met Gly Pro Ala Tyr Asn Gly Tyr 

340 345 350 

Pro Gly Gly Tyr Pro Gly Asp Val Asp Arg Ser Ser Ser Ala Gly Gly 
355 360 365 

Gin Gly Ser Tyr Val Pro Leu Leu Arg Asp Thr Asp Ser Ser Val Ala 
370 375 380 
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Ser Glu Val Arg Ser Gly Tyr Arg lie Gin Ala Ser Gin Gin Asp Asp 
385 *390 395 400 

Ser Met Arg Val Leu Tyr Tyr Met Glu Lys Glu Leu Ala Asn Phe Asp 

405 410 415 

Pro Ser Arg Pro Gly Pro Pro Ser Gly Arg Val Glu Arg Ala Met Ser 

420 425 430 

Glu Val Thr Ser Leu His Glu Asp Asp Trp Arg Ser Arg Pro Ser Arg 
435 0 ' 440 445 

Gly Pro Ala Leu Thr Pro lie Arg Asp Glu Glu Trp Gly Gly His Ser 
450 455 460 

Pro Arg Ser Pro Arg Gly Trp Asp Gin Glu Pro Ala Arg Glu Gin Ala 
465 470 475 480 

Gly Gly Gly Trp Arg Ala Arg Arg Pro Arg Ala Arg Ser Val Asp Ala 

485 490 495 

Leu Asp Asp Leu Thr Pro Pro Ser Thr Ala Glu Ser Gly Ser Arg Ser 

500 505 510 

Pro Thr Ser Asn Gly Gly Arg Ser Arg Ala Tyr Met Pro Pro Arg Ser 
515 520 525 

Arg Ser Arg Asp Asp Leu Tyr Asp Gin Asp Asp Ser Arg Asp Phe Pro 
530 535 540 

Arg Ser Arg Asp Pro His Tyr Asp Asp Phe Arg Ser Arg Glu Arg Pro 
545 550 555 560 

Pro Ala Asp Pro Arg Ser His His His Arg Thr Arg Asp Pro Arg Asp 

565 570 575 

Asn Gly Ser Arg Ser Gly Asp Leu Pro Tyr Asp Gly Arg Leu Leu Glu 

580 585 590 

Glu Ala Val Arg Lys Lys Gly Ser Glu Glu Arg Arg Arg Pro His Lys 
595 600 605 

Glu Glu Glu Glu Glu Ala Tyr Tyr Pro Pro Ala Pro Pro Pro Tyr Ser 
610 615 620 

Glu Thr Asp Ser Gin Ala Ser Arg Glu Arg Arg Leu Lys Lys Asn Leu 
625 630 635 640 

Ala Leu Ser Arg Glu Ser Leu Val Val 

645 



<210> 9 
<211> 2101 
<212> DNA 

<213> Homo sapiens 
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<400> 9 

tggagtgtgg ctcggaggac cgcggcgggt caagcacctt tctcccccat atctgaaagc 60 

atgccctttg tccacgtcgt ttacgctcat taaaacttcc aga atg caa cag gac 115 

Met Gin Gin Asp 
1 

gga ctt gga gta ggg aca agg aac gga agt ggg aag ggg agg age gtg 163 
Gly Leu Gly Val Gly Thr Arg Asn Gly Ser Gly Lys Gly Arg Ser Val 
5 10 15 20 

cac ccc tec tgg cct. tgg tgc gcg ccg cgc ccc eta agg tac ttt gga 211 
His Pro Ser Trp Pro Trp Cys Ala Pro Arg Pro Leu Arg Tyr Phe Gly 

25 30 35 

agg gac gcg egg gee aga cgc gec cag acg gec gcg atg gcg ctg ttg 259 
Arg Asp Ala Arg Ala Arg Arg Ala Gin Thr Ala Ala Met Ala Leu Leu 

40 45 50 



gec ggc ggg etc tec aga ggg ctg ggc tec cac ccg gec gee gca ggc 
Ala Gly Gly Leu Ser Arg Gly Leu Gly Ser His Pro Ala Ala Ala Gly 
55 60 65 



teg ace ccc acg caa ccc ate gtc ate tgg aag tac aag tct ttc tgc 
Ser Thr Pro Thr Gin Pro lie Val He Trp Lys Tyr Lys Ser Phe Cys 

120 125 130 



307 



egg gac gcg gtc gtc ttc gtg tgg ctt ctg ctt age acc tgg tgc aca 355 
Arg Asp Ala Val Val Phe Val Trp Leu Leu Leu Ser Thr Trp Cys Thr 
70 75 80 

get cct gec agg gec ate cag gtg acc gtg tec aac ccc tac cac gtg 4 03 

Ala Pro Ala Arg Ala He Gin Val Thr Val Ser Asn Pro Tyr His Val 
85 90 95 100 

gtg ate etc ttc cag cct gtg acc ctg ccc tgt acc tac cag atg acc 451 
Val He Leu Phe Gin Pro Val Thr Leu Pro Cys Thr Tyr Gin Met Thr 

105 110 115 



499 



egg gac cgc ate gec gat gec ttc tec ccg gee age gtc gac aac cag 547 
Arg Asp Arg He Ala Asp Ala Phe Ser Pro Ala Ser Val Asp Asn Gin 
135 140 145 

etc aat gee cag ctg gca gee ggg aac cca ggc tac aac ccc tac gtt 595 
Leu Asn Ala Gin Leu Ala Ala Gly Asn Pro Gly Tyr Asn Pro Tyr Val 
150 155 160 

gag tgc cag gac age gtg cgc acc gtc agg gtc gtg gee acc aag cag 643 
Glu Cys Gin Asp Ser Val Arg Thr Val Arg Val Val Ala Thr Lys Gin 
165 170 175 180 

ggc aac get gtg acc ctg gga gat tac tac cag ggc egg agg att acc 691 
Gly Asn Ala Val Thr Leu Gly Asp Tyr Tyr Gin Gly Arg Arg He Thr 

185 190 195 

ate acc gga aat get gac ctg acc ttt gac cag acg gcg tgg ggg gac 739 
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He Thr Gly Asn Ala Asp Leu Thr Phe Asp Gin Thr Ala Trp Gly Asp 

200 205 210 

agt ggt gtg tat tac tgc tec gtg gtc tea gee cag gac etc cag ggg 

Ser Gly Val Tyr Tyr Cys Ser Val Val Ser Ala Gin Asp Leu Gin Gly 

215 220 225 

aac aat gag gec tac gca gag etc ate gtc ctt gac tgg etc ttc gtg 

Asn Asn Glu Ala Tyr Ala Glu Leu He Val Leu Asp Trp Leu Phe Val 

230 235 240 

gtt gtg gta tgc ctg get gee ttc etc ate ttc etc etc ctg ggc ate 

Val Val Val Cys Leu Ala Ala Phe Leu He Phe Leu Leu Leu Gly He 

245 250 255 260 



ccc tgc tgc cca gac aag tgc tgc tgc ccc gag gee ctg tat gee gee 
Pro Cys Cys Pro Asp Lys Cys Cys Cys Pro Glu Ala Leu Tyr Ala Ala 

280 285 290 

ggc aaa gca gee ace tea ggt gtt ccc age att tat gee ccc age acc 
Gly Lys Ala Ala Thr Ser Gly Val Pro Ser He Tyr Ala Pro Ser Thr 
295 300 305 

tat gee cac ctg tct ccc gee aag acc cca ccc cca cca get atg att 
Tyr Ala His Leu Ser Pro Ala Lys Thr Pro Pro Pro Pro Ala Met He 
310 315 320 



787 



835 



883 



tgc tgg tgc cag tgc tgc ccg cac act tgc tgc tgc tac gtc agg tgc 931 
Cys Trp Cys Gin Cys Cys Pro His Thr Cys Cys Cys Tyr Val Arg Cys 

265 270 275 



979 



1027 



1075 



ccc atg ggc cct gee tac aac ggg tac cct gga gga tac cct gga gac 1123 

Pro Met Gly Pro Ala Tyr Asn Gly Tyr Pro Gly Gly Tyr Pro Gly Asp 

325 330 335 340 

gtt gac agg agt age tea get ggt ggc caa ggc tec tat gta ccc ctg 1171 

Val Asp Arg Ser Ser Ser Ala Gly Gly Gin Gly Ser Tyr Val Pro Leu 

345 350 355 

ctt egg gac acg gac age agt gtg gee tct gaa gtc cgc agt ggc tac 1219 

Leu Arg Asp Thr Asp Ser Ser Val Ala Ser Glu Val Arg Ser Gly Tyr 

360 365 370 

agg att cag gee age cag cag gac gac tec atg egg gtc ctg tac tac 1267 

Arg He Gin Ala Ser Gin Gin Asp Asp Ser Met Arg Val Leu Tyr Tyr 
375 380 385 

atg gag aag gag ctg gee aac ttc gac cct tct cga cct ggc ccc ccc 1315 
Met Glu Lys Glu Leu Ala Asn Phe Asp Pro Ser Arg Pro Gly Pro Pro 
390 395 400 



agt ggc cgt gtg gag egg gee atg agt gaa gtc acc tec etc cac gag 1363 
Ser Gly Arg Val Glu Arg Ala Met Ser Glu Val Thr Ser Leu His Glu 
405 ' ' 410 415 420 
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gac gac tgg cga tct egg cct tec egg ggc cct gec etc ace ccg ate 1411 
Asp Asp Trp Arg Ser Arg Pro Ser Arg Gly Pro Ala Leu Thr Pro lie 

425 430 435 

egg gat gag gag tgg ggt ggc cac tec ccc egg agt ccc agg gga tgg 1459 
Arg Asp Glu Glu Trp Gly Gly His Ser Pro Arg Ser Pro Arg Gly Trp 

440 445 450 

gac cag gag ccc gec agg gag cag gca ggc ggg ggc tgg egg gee agg 1507 
Asp Gin Glu Pro Ala Arg Glu Gin Ala Gly Gly Gly Trp Arg Ala Arg 
455 460 465 

egg ccc egg gec cgc tec gtg gac gee ctg gac gac etc acc ccg ccg 1555 
Arg Pro Arg Ala Arg Ser Val Asp Ala Leu Asp Asp Leu Thr Pro Pro 
470 475 480 

age acc gee gag tea ggg age agg tct ccc acg agt aat ggt ggg aga 1603 
Ser Thr Ala Glu Ser Gly Ser Arg Ser Pro Thr Ser Asn Gly Gly Arg 
485 490 495 500 

age egg gec tac atg ccc ccg egg age cgc age egg gac gac etc tat 1651 
Ser Arg Ala Tyr Met Pro Pro Arg Ser Arg Ser Arg Asp Asp Leu Tyr 

505 510 515 

gac caa gac gac teg agg gac ttc cca cgc tec egg gac ccc cac tac 1699 
Asp Gin Asp Asp Ser Arg Asp Phe Pro Arg Ser Arg Asp Pro His Tyr 

520 525 530 

gac gac ttc agg tct egg gag cgc cct cct gee gac ccc agg tec cac 1747 
Asp Asp Phe Arg Ser Arg Glu Arg Pro Pro Ala Asp Pro Arg Ser His 
535 540 545 

cac cac cgt acc egg gac cct egg gac aac ggc tec agg tec ggg gac 1795 
His His Arg Thr Arg Asp Pro Arg Asp Asn Gly Ser Arg Ser Gly Asp 
550 555 560 

etc ccc tat gat ggg egg eta ctg gag gag get gtg agg aag aag ggg 1843 
Leu Pro Tyr Asp Gly Arg Leu Leu Glu Glu Ala Val Arg Lys Lys Gly 
565 570 575 580 

teg gag gag agg agg aga ccc cac aag gag gag gag gaa gag gec tac 1891 
Ser Glu Glu Arg Arg Arg Pro His Lys Glu Glu Glu Glu Glu Ala Tyr 

585 590 595 

tac ccg ccc gcg ccg ccc ccg tac teg gag acc gac teg cag gcg tec 1939 
Tyr Pro Pro Ala Pro Pro Pro Tyr Ser Glu Thr Asp Ser Gin Ala Ser 

600 605 610 

cga gag cgc agg etc aag aag aac ttg gee ctg agt egg gaa agt tta 1987 
Arg Glu Arg Arg Leu Lys Lys Asn Leu Ala Leu Ser Arg Glu Ser Leu 
615 620 625 

gtc gtc tga tctgacgttt tetaegtage ttttgtattt ttttttttaa 2036 
Val Val * 
630 

tttgaaggaa cactgatgaa gccctgccat acccctcccg agtctaataa aacgtataat 2096 
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cacaa 



<210> 10 

<211> 630 

<212> PRT 

<213> Homo sapiens 



<400> 10 

Met Gin Gin Asp Gly Leu Gly Val Gly Thr Arg Asn Gly Ser Gly Lys 
15 10 15 

Gly Arg Ser Val His Pro Ser Trp Pro Trp Cys Ala Pro Arg Pro Leu 

20 25 30 

Arg Tyr Phe Gly Arg Asp Ala Arg Ala- Arg Arg Ala Gin Thr Ala Ala 
35 40 45 

Met Ala Leu Leu Ala Gly Gly Leu Ser Arg Gly Leu Gly Ser His Pro 
50 55 60 

Ala Ala Ala Gly Arg Asp Ala Val Val Phe Val Trp Leu Leu Leu Ser 
65 70 75 80 

Thr Trp Cys Thr Ala Pro Ala Arg Ala He Gin Val Thr Val Ser Asn 

85 90 95 

Pro Tyr His Val Val He Leu Phe Gin Pro Val Thr Leu Pro Cys Thr 

100 105 110 

Tyr Gin Met Thr Ser Thr Pro Thr Gin Pro He Val He Trp Lys Tyr 
115 120 125 

Lys Ser Phe Cys Arg Asp Arg lie Ala Asp Ala Phe Ser Pro Ala Ser 
130 135 140 

Val Asp Asn Gin Leu Asn Ala Gin Leu Ala Ala Gly Asn Pro Gly Tyr 
145 150 155- 160 

Asn Pro Tyr Val Glu Cys Gin Asp Ser Val Arg Thr Val Arg Val Val 

165 170 175 

Ala Thr Lys Gin Gly Asn Ala Val Thr Leu Gly Asp Tyr Tyr Gin Gly 

180 185 190 

Arg Arg He Thr He Thr Gly Asn Ala Asp Leu Thr Phe Asp Gin Thr 
195 200 205 

Ala Trp Gly Asp Ser Gly Val Tyr Tyr Cys Ser Val Val Ser Ala Gin 
210 215 220 

Asp Leu Gin Gly Asn Asn Glu Ala Tyr Ala Glu Leu He Val Leu Asp 
225 230 . 235 240 

Trp Leu Phe Val Val Val Val Cys Leu Ala Ala Phe Leu He Phe Leu 



2101 



25 



245 



250 



255 



Leu Leu Gly lie Cys Trp Cys Gin Cys Cys Pro His Thr Cys Cys Cys 

260 265 270 

Tyr Val Arg Cys Pro Cys Cys Pro Asp Lys Cys Cys Cys Pro Glu Ala 
275 280 285 

Leu Tyr Ala Ala Gly Lys Ala Ala Thr Ser Gly Val Pro Ser lie Tyr 

290 295 .300 

Ala Pro Ser Thr Tyr Ala His Leu Ser Pro Ala Lys Thr Pro Pro Pro 
305 310 315 320 

Pro Ala Met lie Pro Met Gly Pro Ala Tyr Asn Gly Tyr Pro Gly Gly 

325 330 335 

Tyr Pro Gly Asp Val Asp Arg Ser Ser Ser Ala Gly Gly Gin Gly Ser 

340 345 350 

Tyr Val Pro Leu Leu Arg Asp Thr Asp Ser Ser Val Ala Ser Glu Val 
355 360 365 

Arg Ser Gly Tyr Arg lie Gin Ala Ser Gin Gin Asp Asp Ser Met Arg 
370 375 380 

Val Leu Tyr Tyr Met Glu Lys Glu Leu Ala Asn Phe Asp Pro Ser Arg 
385 390 395 400 

Pro Gly Pro Pro Ser Gly Arg Val Glu Arg Ala Met Ser Glu Val Thr 

405 410 415 

Ser Leu His Glu Asp Asp Trp Arg Ser Arg Pro Ser Arg Gly Pro Ala 

420 425 430 

Leu Thr Pro He Arg Asp Glu Glu Trp Gly Gly His Ser Pro Arg Ser 
435 440 445 

Pro Arg Gly Trp Asp Gin Glu Pro Ala Arg Glu Gin Ala Gly Gly Gly 
450 455 460 

Trp Arg Ala Arg Arg Pro Arg Ala Arg Ser Val Asp Ala Leu Asp Asp 
465 470 475 480 

Leu Thr Pro Pro Ser Thr Ala Glu Ser Gly Ser Arg Ser Pro Thr Ser 

485 490 495 

Asn Gly Gly Arg Ser Arg Ala Tyr Met Pro Pro Arg Ser Arg Ser Arg 

500 505 510 



Asp Asp Leu Tyr Asp Gin Asp Asp Ser Arg Asp Phe Pro Arg Ser Arg 
515 520 525 

Asp Pro His Tyr Asp Asp Phe Arg Ser Arg Glu Arg Pro Pro Ala Asp 
530 535 540 

Pro Arg Ser His His His Arg Thr Arg Asp Pro Arg Asp Asn Gly Ser 
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545 

Arg Ser Gly Asp 



Arg Lys Lys Gly 

580 

Glu Glu Ala Tyr 
595 

Ser Gin Ala Ser 
610 

Arg Glu Ser Leu 
625 



550 

Leu Pro Tyr Asp 
565 

Ser Glu Glu Arg 



Tyr Pro Pro Ala 

600 

Arg Glu Arg Arg 
615 

Val Val 
630 



555 

Gly Arg Leu Leu 
570 

Arg Arg Pro His 
585 

Pro Pro Pro Tyr 



Leu Lys Lys Asn 

620 



560 

Glu Glu Ala Val 
575 

Lys Glu Glu Glu 
590 

Ser Glu Thr Asp 
605 

Leu Ala Leu Ser 



<210> 11 

<211> 1954 

<212> DNA 

<213> Homo sapiens 



<400> 11 

tggagtgtgg ctcggaggac cgcggcgggt caagcacctt tctcccccat atctgaaagc 

atgccctttg tccacgtcgt ttacgctcat taaaacttcc aga atg caa cag gac 

Met Gin Gin Asp 
1 



gga ctt gga gta ggg aca agg aac gga ag.t ggg aag ggg agg age gtg 

Gly Leu Gly Val Gly Thr Arg Asn Gly Ser Gly Lys Gly Arg Ser Val 
5 10 15 20 

cac ccc tec tgg cct tgg tgc gcg ccg cgc ccc eta agg tac ttt gga 

His Pro Ser Trp Pro Trp Cys Ala Pro Arg Pro Leu Arg Tyr Phe Gly 

25 30 35 



agg gac gcg egg gec aga cgc gec cag acg gec gcg atg gcg ctg ttg 

Arg Asp Ala Arg Ala Arg Arg Ala Gin Thr Ala Ala Met Ala Leu Leu 

40 45 50 

gec ggc ggg etc tec aga ggg ctg ggc tec cac ccg gee gee gca ggc 

Ala Gly Gly Leu Ser Arg Gly Leu Gly Ser His Pro Ala Ala Ala Gly 
55 60 65 



egg gac gcg gtc gtc ttc gtg tgg ctt ctg ctt age acc tgg tgc aca 

Arg Asp Ala Val Val Phe Val Trp Leu Leu Leu Ser Thr Trp Cys Thr 

70 75 80 

get cct gec agg gee ate cag gtg acc gtg tec aac ccc tac cac gtg 

Ala Pro Ala Arg Ala He Gin Val Thr Val Ser Asn Pro Tyr His Val 

85 90 95 100 

gtg ate etc ttc cag cct gtg acc ctg ccc tgt acc tac cag atg acc 
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Val lie Leu Phe Gin Pro Val Thr Leu Pro Cys Thr Tyr Gin Met Thr 

105 110 115 



teg acc ccc acg caa ccc ate gtc ate tgg aag tac aag tct ttc tgc 
Ser Thr Pro Thr Gin Pro He Val He Trp Lys Tyr Lys Ser Phe Cys 

120 125 130 

egg gac cgc ate gee gat gee ttc tec ccg gee age gtc gac aac cag 
Arg Asp Arg He Ala Asp Ala Phe Ser Pro .Ala Ser Val Asp Asn Gin 
135 140 145 

etc aat gee cag ctg gca gee ggg aac cca ggc tac aac ccc tac gtt 
Leu Asn Ala Gin Leu Ala Ala Gly Asn Pro Gly Tyr Asn Pro Tyr Val 
150 155 160 

gag tgc cag gac age gtg cgc acc gtc agg gtc gtg gee acc aag cag 
Glu Cys Gin Asp Ser Val Arg Thr Val Arg Val Val Ala Thr Lys Gin 
165 170 175 180 

ggc aac get gtg acc ctg gga gat tac tac cag ggc egg agg att acc 
Gly Asn Ala Val Thr Leu Gly Asp Tyr Tyr Gin Gly Arg Arg He Thr 

185 190 195 

ate acc gga aat get gac ctg acc ttt gac cag acg gcg tgg ggg gac 
He Thr Gly Asn Ala Asp Leu Thr Phe Asp Gin Thr Ala Trp Gly Asp 

200 205 210 

agt ggt gtg tat tac tgc tec gtg gtc tea gee cag gac etc cag ggg 
Ser Gly Val Tyr Tyr Cys Ser Val Val Ser Ala Gin Asp Leu Gin Gly 
215 220 225 

aac aat gag gee tac gca gag etc ate gtc ctt gtg tat gec gee ggc 
Asn Asn Glu Ala Tyr Ala Glu Leu He Val Leu Val Tyr Ala Ala Gly 
230 235 240 

aaa gca gee acc tea ggt gtt ccc age att tat gee ccc age acc tat 
Lys Ala Ala Thr Ser Gly Val Pro Ser He Tyr Ala Pro Ser Thr Tyr 
245 ■ 250 255 260 

gee cac ctg tct ccc gee aag acc cca ccc cca cca get atg att ccc 
Ala His Leu Ser Pro Ala Lys Thr Pro Pro Pro Pro Ala Met He Pro 

265 270 275 

atg ggc cct gee tac aac ggg tac cct gga gga tac cct gga gac gtt 
Met Gly Pro Ala Tyr Asn Gly Tyr Pro Gly Gly Tyr Pro Gly Asp Val 

280 285 290 

gac agg agt age tea get ggt ggc caa ggc tec tat gta ccc ctg ctt 
Asp Arg Ser Ser Ser Ala Gly Gly Gin Gly Ser Tyr Val Pro Leu Leu 
295 300 305 

egg gac acg gac age agt gtg gee tct gaa gtc cgc agt ggc tac agg 
Arg Asp Thr Asp Ser Ser Val Ala Ser Glu Val Arg Ser Gly Tyr Arg 
310 315 320 

att cag gee age cag cag gac gac tec atg egg gtc ctg tac tac atg 
He Gin Ala Ser Gin Gin Asp Asp Ser Met Arg Val Leu Tyr Tyr Met 
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325 330 335 340 

gag aag gag ctg gcc aac ttc gac cct tct cga cct ggc ccc ccc agt 
Glu Lys Glu Leu Ala Asn Phe Asp Pro Ser Arg Pro Gly Pro Pro Ser 

345' 350 355 

ggc cgt gtg gag egg gcc atg agt gaa gtc acc tec etc cac gag gac 
Gly Arg Val Glu Arg Ala Met Ser Glu Val Thr Ser Leu His Glu Asp 

360 365 370 

gac tgg cga tct egg cct tec egg ggc cct gcc etc acc ccg ate egg 
Asp Trp Arg Ser Arg Pro Ser Arg Gly Pro Ala Leu Thr Pro lie Arg 
375 380 385 

gat gag gag tgg ggt ggc cac tec ccc egg agt ccc agg gga tgg gac 
Asp Glu Glu Trp Gly Gly His Ser Pro Arg Ser Pro Arg Gly Trp Asp 
390 395 400 

cag gag ccc gcc agg gag cag gca ggc ggg ggc tgg egg gcc agg egg 
Gin Glu Pro Ala Arg Glu Gin Ala Gly Gly Gly Trp Arg Ala Arg Arg 
405 410 415 420 

ccc egg gcc cgc tec gtg gac gcc ctg gac gac etc acc ccg ccg age 
Pro Arg Ala Arg Ser Val Asp Ala Leu Asp Asp Leu Thr Pro Pro Ser 

425 430 435 

acc gcc gag tea ggg age agg tct ccc acg agt aat ggt ggg aga age 
Thr Ala Glu Ser Gly Ser Arg Ser Pro Thr Ser Asn Gly Gly Arg Ser 

440 445 450 

egg gcc tac atg ccc ccg egg age cgc age egg gac gac etc tat gac 
Arg Ala Tyr- Met Pro Pro Arg Ser Arg Ser Arg Asp Asp Leu Tyr Asp 
455 460 465 

caa gac gac teg agg gac ttc.cca cgc tec egg gac ccc cac tac gac 
Gin Asp Asp Ser Arg Asp Phe Pro Arg Ser Arg Asp Pro His Tyr Asp 
470 475 480 

gac ttc agg tct egg gag cgc cct cct gcc gac ccc agg tec cac cac 
Asp Phe Arg Ser Arg Glu Arg Pro Pro Ala Asp Pro Arg Ser His His 
485 490 495 500 

cac cgt acc egg gac cct egg gac aac ggc tec agg tec ggg gac etc 
His Arg Thr Arg Asp Pro Arg Asp Asn Gly Ser Arg Ser Gly Asp Leu 

505 510 515 

ccc tat gat ggg egg eta ctg gag gag get gtg agg aag aag ggg teg 
Pro Tyr Asp Gly Arg Leu Leu Glu Glu Ala Val Arg Lys Lys Gly Ser 

520 525 530 

gag gag agg agg aga ccc cac aag gag gag gag gaa gag gcc tac tac 
Glu Glu Arg Arg Arg Pro His Lys Glu Glu Glu Glu Glu Ala Tyr Tyr 
535 540 545 

ccg ccc gcg ccg ccc ccg tac teg gag acc gac teg cag gcg tec cga 
Pro Pro Ala Pro Pro Pro Tyr Ser Glu Thr Asp Ser Gin Ala Ser Arg 
550 555 560 
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gag cgc agg etc aag aag aac ttg gec ctg agt egg gaa agt tta gtc 
Glu Arg Arg Leu Lys Lys Asn Leu Ala Leu Ser Arg Glu Ser Leu Val 
565 ' 570 575 580 



1843 



gtc tga tctgacgttt tetaegtage ttttgtattt ttttttttaa tttgaaggaa 1899 
Val * 

cactgatgaa gccctgccat acccctcccg agtctaataa aacgtataat cacaa 1954 



<210> 12 
<211> 581 
<212> PRT 

<213> Homo sapiens 



<400> 12 

Met Gin Gin Asp Gly Leu Gly Val Gly Thr Arg Asn Gly Ser Gly Lys 
1 5 10 15 

Gly Arg Ser Val His Pro Ser Trp Pro Trp Cys Ala Pro Arg Pro Leu 

20 25 30 

Arg Tyr Phe Gly Arg Asp Ala Arg Ala Arg Arg Ala Gin Thr Ala Ala 
35 40 * 45 

Met Ala Leu Leu Ala Gly Gly Leu Ser Arg Gly Leu Gly Ser His Pro 
50 55 60 

Ala Ala Ala Gly Arg Asp Ala Val Val Phe Val Trp Leu Leu Leu Ser 
65 70 75 80 

Thr Trp Cys Thr Ala Pro Ala Arg Ala He Gin Val Thr Val Ser Asn 

85 90 95 

Pro Tyr His Val Val He Leu Phe Gin Pro Val Thr Leu Pro Cys Thr 

100 105 HO 

Tyr Gin Met Thr Ser Thr Pro Thr Gin Pro He Val He Trp Lys Tyr 
115 120 125 

Lys Ser Phe Cys Arg Asp Arg lie Ala Asp Ala Phe Ser Pro Ala Ser 
130 135 140 

Val Asp Asn Gin Leu Asn Ala Gin Leu Ala Ala Gly Asn Pro Gly Tyr 
145 150 155 160 

Asn Pro Tyr Val Glu Cys Gin Asp Ser Val Arg Thr Val Arg Val Val 

165 170 175 

Ala Thr Lys Gin Gly Asn Ala Val Thr Leu Gly Asp Tyr Tyr Gin Gly 

180 185 190 

Arg Arg He Thr He Thr Gly Asn Ala Asp Leu Thr Phe Asp Gin Thr 
195 200 205 



30 



Ala Trp Gly Asp 
210 

Asp Leu Gin Gly 
225 

Tyr Ala Ala Gly 



Pro Ser Thr Tyr 

260 

Ala Met He Pro 
275 

Pro Gly Asp Val 
290 

Val Pro Leu Leu 
305 

Ser Gly Tyr Arg 



Leu Tyr Tyr Met 

340 

Gly Pro Pro Ser 
355 

Leu His Glu Asp 
370 

Thr Pro He Arg 
385 

Arg Gly Trp Asp 



Arg Ala Arg Arg 

420 

Thr Pro Pro Ser 
435 

Gly Gly Arg Ser 
450 

' Asp Leu Tyr Asp 
465 

Pro His Tyr Asp 



Arg Ser His His 

500 



Ser Gly Val Tyr 
215 

Asn Asn Glu Ala 
230 

Lys Ala Ala Thr 
245 

Ala His Leu Ser 



Met Gly Pro Ala 

280 

Asp Arg Ser Ser 
295 

Arg Asp Thr Asp 
310 

He Gin Ala Ser 
325 

Glu Lys Glu Leu 



Gly Arg Val Glu 

360 

Asp Trp Arg Ser 
375 

Asp Glu Glu Trp 
390 

Gin Glu Pro Ala 
405 

Pro Arg Ala Arg 



Thr Ala Glu Ser 

440 

Arg Ala Tyr Met 
455 

Gin Asp Asp Ser 
470 

Asp Phe Arg Ser 
485 

His Arg Thr Arg 



Tyr Cys Ser Val 

220 

Tyr Ala Glu Leu 
235 

Ser Gly Val Pro 
250 

Pro Ala Lys Thr 
265 

Tyr Asn Gly Tyr 



Ser Ala Gly Gly 

300 

Ser Ser Val Ala 
315 

Gin Gin Asp Asp 
330 

Ala Asn Phe Asp 
345 

Arg Ala Met Ser 



Arg Pro Ser Arg 

380 

Gly Gly His Ser 
395 

Arg Glu Gin Ala 
410 

Ser Val Asp Ala 

425 

Gly Ser Arg Ser 



Pro Pro Arg Ser 

460 

Arg Asp Phe Pro 
475 

Arg Glu Arg Pro 
490 

Asp Pro Arg Asp 
505 



Val Ser Ala Gin 



He Val Leu Val 

240 

Ser He Tyr Ala 
255 

Pro Pro Pro Pro 
270 

Pro Gly Gly. Tyr 
285 

Gin Gly Ser Tyr 



Ser Glu Val Arg 

320 

Ser Met Arg Val 
335 

Pro Ser Arg Pro 
350 

Glu Val Thr Ser 
365 

Gly Pro Ala Leu 



Pro Arg Ser Pro 

400 

Gly Gly Gly Trp 
415 

Leu Asp Asp Leu 
430 

Pro Thr Ser Asn 
445 

Arg Ser Arg Asp 



Arg Ser Arg Asp 

480 

Pro Ala Asp Pro 
495 

Asn Gly Ser Arg 
510 
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Ser Gly Asp Leu 
515 

Lys Lys Gly Ser 
530 

Glu Ala Tyr Tyr 
545 

Gin Ala Ser Arg 



Glu Ser Leu Val 

580 



Pro Tyr Asp Gly 

520 

Glu Glu Arg Arg 
535 

Pro Pro Ala Pro 
550 

Glu Arg Arg Leu 
565 

Val 



Arg Leu Leu Glu 



Arg Pro His Lys 

540 

Pro Pro Tyr Ser 
555 

Lys Lys Asn Leu 
570 



Glu Ala Val Arg 
525 

Glu Glu Glu Glu 



Glu Thr Asp Ser 

560 

Ala Leu Ser Arg 
575 



<210> 13 
<211> 1886 
<212> DNA 

<213> Mus musculus 
<400> 13 

gcaccgtcgc tgctagacgg ccgcg atg gcg ccg gcg gcc age gcg tgt get 

Met Ala Pro Ala Ala Ser Ala Cys Ala 
15 

ggg gcg cct ggc tec cac ccg gcc acc acg ate ttc gtg tgt ctt ttt 
Gly Ala Pro Gly Ser His Pro Ala Thr Thr He Phe Val Cys Leu Phe 
10 15 20 25 

etc ate att tac tgc cca gac cgt gcc agt gcc ate cag gtg acc gtg 
Leu He He Tyr Cys Pro Asp Arg Ala Ser Ala He Gin Val Thr Val 

30 35 40 

cct gac ccc tac cac gta gtg ate ctg ttc cag cca gtg aca eta cac 
Pro Asp Pro Tyr His Val Val He Leu Phe Gin Pro Val Thr Leu His 

45 50 55 

tgc acc tac cag atg age aat acc etc aca gcc cct ate gtg ate tgg 
Cys Thr Tyr Gin Met Ser Asn Thr Leu Thr Ala Pro He Val He Trp 
60 65 70 



aag tat aag teg ttc tgt egg gac cgt gtt gcc gac gcc ttc tec cct 

Lys Tyr Lys Ser Phe Cys Arg Asp Arg Val Ala Asp Ala Phe Ser Pro 

75 80 85 

gcc age gtg gac aac cag etc aac gcc cag ctg gcg get ggc aac ccc 

Ala Ser Val Asp Asn Gin Leu Asn Ala Gin Leu Ala Ala Gly Asn Pro 

90 95 100 105 

ggc tac aac ccc tat gtg gag tgc cag gac age gta cgc act gtc agg 

Gly Tyr Asn Pro Tyr Val Glu Cys Gin Asp Ser Val Arg Thr Val Arg 

110 115 120 



52 



100 



148 



196 



244 



292 



340 



388 
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gtg gtg gcc acc aaa cag ggc aat get gtg acc ctg gga gac tac tac 
Val Val Ala Thr Lys Gin Gly Asn Ala Val Thr Leu Gly Asp Tyr Tyr 

125 130 135 

cag ggc agg aga ate acc ate aca gga aat get ggc ctg acc ttc gag 
Gin Gly Arg Arg He Thr He Thr Gly Asn Ala Gly Leu Thr Phe Glu 
140 ' 145 150 

cag acg gcc tgg gga gac agt gga gtg tat tac tgc tec gtg gtc tea 
Gin Thr Ala Trp Gly Asp Ser Gly Val Tyr Tyr Cys Ser Val Val Ser 
155 160 165 

gcc caa gat ctg gat ggg aac aac gag gcg tac gca gag etc att gtc 
Ala Gin Asp Leu Asp Gly Asn Asn Glu Ala Tyr Ala Glu Leu He Val 
170 175 180 185 

ctt ggc agg acc tea gaa gcc cct gag etc eta cct ggt ttt egg gcg 
Leu Gly Arg Thr Ser Glu Ala Pro Glu Leu Leu Pro Gly Phe Arg Ala 

190 195 200 

ggg ccc ttg gaa gat tgg etc ttt gtg gtc gtg gtc tgc ctg gca age 
Gly Pro Leu Glu Asp Trp Leu Phe Val Val Val Val Cys Leu Ala Ser 

205 210 215 

etc etc ttc ttc etc etc ctg ggc ate tgc tgg tgc cag tgc tgt ccc 
Leu Leu Phe Phe Leu Leu Leu Gly He Cys Trp Cys Gin Cys Cys Pro 
220 225 230 



436 



484 



532 



580 



628 



676 



724 



820 



868 



916 



cac acc tgc tgc tgc tat gtc aga tgt ccc tgc tgc cca gac aag tgc 772 
His Thr Cys Cys Cys Tyr Val Arg Cys Pro Cys Cys Pro Asp Lys Cys 
235 240 245 

tgt tgc cct gag gcc ctt tat get get ggc aaa gca gcc acc tea ggt 
Cys Cys Pro Glu Ala Leu Tyr Ala Ala Gly Lys Ala Ala Thr Ser Gly 
250 255 260 265 

gtg cca age ate tat gcc ccc age ate tat acc cac etc tct cct gcc 
Val Pro Ser He Tyr Ala Pro Ser lie Tyr Thr His Leu Ser Pro Ala 

270 275 280 

aag act ccg cca cct ccg cct gcc atg att ccc atg cgt cct ccc tat 
Lys Thr Pro Pro Pro Pro Pro Ala Met He Pro Met Arg Pro Pro Tyr 

285 290 295 

ggg tac cct gga gac ttt gac agg acc age tea gtt ggt ggc cac age 
Gly Tyr Pro Gly Asp Phe Asp Arg Thr Ser Ser Val Gly Gly His Ser 
300 305 310 

tec cag gtg ccc ctg ctg cgt gaa gtg gat ggg age gta tct tea gaa 
Ser Gin Val Pro Leu Leu Arg Glu Val Asp Gly Ser Val Ser Ser Glu 
315 320 325 

gta cga agt ggc tac agg ate cag get aac cag caa gat gac tec atg 
Val Arg Ser Gly Tyr Arg He Gin Ala Asn Gin Gin Asp Asp Ser Met 
330 335 340 345 

agg gtc eta tac tat atg gag aag gag eta gcc aac ttc gat cct tec 
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964 



1012 



1060 



1108 



I 



Arg Val Leu Tyr Tyr Met Glu Lys Glu Leu Ala Asn Phe Asp Pro Ser 

350 355 360 

egg cct ggc cct ccc aat ggc cga gtg gaa egg gec atg agt gaa gta 1156 
Arg Pro Gly Pro Pro Asn Gly Arg Val Glu Arg Ala Met Ser Glu Val 

365 370 375 



acc tec etc cat gaa gat gac tgg cga tct egg cct tec agg get cct 
Thr Ser Leu His Glu Asp Asp Trp Arg Ser Arg Pro Ser Arg Ala Pro 
380 385 390 



agt ccc aga aca tgg gag cag gaa ccc ctt caa gaa cag cca agg ggt 
Ser Pro Arg Thr Trp Glu Gin Glu Pro Leu Gin Glu Gin Pro Arg Gly 
410 415 420 425 



gac ate aac egg cct ggc tec act gaa tea gga agg tct tct ccc cca 
Asp He Asn Arg Pro Gly Ser Thr Glu Ser Gly Arg Ser Ser Pro Pro 

445 450 455 



egg gat gac etc tat gac ccc gac gat cct aga gac ttg cca cat tec 
Arg Asp Asp Leu Tyr Asp Pro Asp Asp Pro Arg Asp Leu Pro His Ser 
475 480 485 

cga gat ccc cac tat tat gat gat ttg agg tct agg gat cca cgt get 
Arg Asp Pro His Tyr Tyr Asp Asp Leu Arg Ser Arg Asp Pro Arg Ala 
490 495 500 505 

gac ccc aga tec cgt cag cga tec cac gat cct egg gat get ggc ttc 
Asp Pro Arg Ser Arg Gin Arg Ser His Asp Pro Arg Asp Ala Gly Phe 

510 515 520 

agg tea egg gac cct cag tat gat ggg cga etc tta gaa gag get tta 
Arg Ser Arg Asp Pro Gin Tyr Asp Gly Arg Leu Leu Glu Glu Ala Leu 

525 530 535 

aag aaa aaa ggg get ggg gag aga aga cgc gtt tac agg gag gaa gaa 
Lys Lys Lys Gly Ala Gly Glu Arg Arg Arg Val Tyr Arg Glu Glu Glu 
540 545 550 

gaa gaa gaa gag gag ggc cac tat ccc cca gca cct ccg cct tac tct 
Glu Glu Glu Glu Glu Gly His Tyr Pro Pro Ala Pro Pro Pro Tyr Ser 
555 560 565 

gag act gac teg cag gee teg agg gag egg agg atg aaa aag aat ttg 
Glu Thr Asp Ser Gin Ala Ser Arg Glu Arg Arg Met Lys Lys Asn Leu 



1204 



gee etc aca ccc ate agg gat gag gag tgg aat cgc cac tec cct egg 1252 
Ala Leu Thr Pro He Arg Asp Glu Glu Trp Asn Arg His Ser Pro Arg 
395 400 405 



1300 



ggt tgg ggg tct ggg egg cct egg gee cgc tct gtg gat get eta gat 134 8 

Gly Trp Gly Ser Gly Arg Pro Arg Ala Arg Ser Val Asp Ala Leu Asp 

430 435 440 



1396 



agt agt gga egg aga ggg egg gee tat gca cct ccg aga agt cgc age 144 4 

Ser Ser Gly Arg Arg Gly Arg Ala Tyr Ala Pro Pro Arg Ser Arg Ser 
460 465 470 



1492 



1540 



1588 



1636 



1684 



1732 



1780 



34 



570 



575 580 585 



gcc ctg agt egg gaa agt tta gtc gtc tga tcccacgttt tgttatgtag 
Ala Leu Ser Arg Glu Ser Leu Val Val * 

590 

cttttatact tttttaattg gaatattgat gaaactcttc accaagccta ataaaa 



ggg gcg cct ggc tec cac ccg gcc ace acg ate ttc gtg tgt ctt ttt 
Gly Ala Pro Gly Ser His Pro Ala Thr Thr He Phe Val Cys Leu Phe 
10 15 20 25 

etc ate att tac tgc cca gac cgt gcc agt gcc ate cag gtg ace gtg 
Leu He He Tyr Cys Pro Asp Arg Ala Ser Ala He Gin Val Thr Val 

30 35 40 

cct gac ccc tac cac gta gtg ate ctg ttc cag cca gtg aca eta cac 
Pro Asp Pro Tyr His Val Val He Leu Phe Gin Pro Val Thr Leu His 

45 50 55 

tgc ace tac cag atg age aat acc etc aca gcc cct ate gtg ate tgg 
Cys Thr Tyr Gin Met Ser Asn Thr Leu Thr Ala Pro He Val lie Trp 
60 65 70 

aag tat aag teg ttc tgt egg gac cgt gtt gcc gac gcc ttc tec cct 
Lys Tyr Lys Ser Phe Cys Arg Asp Arg Val Ala Asp Ala Phe Ser Pro 
75 80 85 

gcc age gtg gac aac cag etc aac gcc cag ctg gcg get ggc aac ccc 
Ala Ser Val Asp Asn Gin Leu Asn Ala Gin Leu Ala Ala Gly Asn Pro 
90 95 100 105 

ggc tac aac ccc tat gtg gag tgc cag gac age gta cgc act gtc agg 
Glv Tvr Asn Pro Tyr Val Glu Cys Gin Asp Ser Val Arg Thr Val Arg 

110 H5 120 

gtg gtg' gcc acc aaa cag ggc aat get gtg acc ctg gga gac tac- tac 
Val Val Ala Thr Lys Gin Gly Asn Ala Val Thr Leu Gly Asp Tyr Tyr 

125 130 135. 

cag ggc agg aga ate acc ate aca gga aat get ggc ctg acc ttc gag 
Gin Gly Arg Arg He Thr He Thr Gly Asn Ala Gly Leu Thr Phe Glu 
140 145 150 



1830 



1886 



<2i0> 14 

<211> 1829 

<212> DNA 

<213> Mus musculus 



<400> 14 • 
gcaccgtcgc tgetagaegg ccgcg atg gcg ccg gcg gcc age gcg tgt get 

Met Ala Pro Ala Ala Ser Ala Cys Ala 

1 5 



100 



148 



196 



244 



292 



340 



388 



436 



484 



35 



I 



cag acg gcc tgg gga gac agt gga gtg tat tac tgc tec gtg gtc tea 
Gin Thr Ala Trp Gly Asp Ser Gly Val Tyr Tyr Cys Ser Val Val Ser 
155 160 165 

gcc caa gat ctg gat ggg aac aac gag gcg tac gca gag etc att gtc 
Ala Gin Asp Leu' Asp Gly Asn Asn Glu Ala Tyr Ala Glu Leu He Val 
170 175 180 185 

ctt gat tgg etc ttt gtg gtc gtg gtc tgc ctg gca age etc etc ttc 
Leu Asp Trp Leu Phe Val Val Val Val Cys Leu Ala Ser Leu Leu Phe 

190 195 200 

ttc etc etc ctg ggc ate tgc tgg tgc cag tgc tgt ccc cac acc tgc 
Phe Leu Leu Leu Gly lie Cys Trp Cys Gin Cys Cys Pro His Thr Cys 

205 210 215 

tgc tgc tat gtc aga tgt ccc tgc tgc cca gac aag tgc tgt tgc cct 
Cys Cys Tyr Val Arg Cys Pro Cys Cys Pro Asp Lys Cys Cys Cys Pro 
220 225 230 



532 



ate tat gcc ccc age ate tat acc cac etc tct cct gcc aag act ccg 
He Tyr Ala Pro Ser He Tyr Thr . His Leu Ser Pro Ala Lys Thr Pro 
250 255 260 265 

cca cct ccg cct gcc atg att ccc atg cgt cct ccc tat ggg tac cct 
Pro Pro Pro Pro Ala Met He Pro Met Arg Pro Pro Tyr Gly Tyr Pro 

270 275 280 

gga gac ttt gac agg acc age tea gtt ggt ggc cac age tec cag gtg 
Gly Asp Phe Asp Arg Thr Ser Ser Val Gly Gly His Ser Ser Gin Val 

285 290 295 

ccc ctg ctg cgt gaa gtg gat ggg age gta tct tea gaa gta cga agt 
Pro Leu Leu Arg Glu Val Asp Gly Ser Val Ser Ser Glu Val Arg Ser 
300 305 310 

ggc tac agg ate cag get aac cag caa gat gac tec atg agg gtc eta 
Gly Tyr Arg He Gin Ala Asn Gin Gin Asp Asp Ser Met Arg Val Leu 
315 320 325 

tac tat atg gag aag gag eta gcc aac ttc gat cct tec egg cct ggc 
Tyr Tyr Met Glu Lys Glu Leu Ala Asn Phe Asp Pro Ser Arg Pro Gly 
330 335 340 345 

cct ccc aat ggc cga gtg gaa egg gcc atg agt gaa gta acc tec etc 
Pro Pro Asn Gly Arg Val Glu Arg Ala Met Ser Glu Val .Thr Ser Leu 

350 355 360- 



580 



628 



676 



724 



gag gcc ctt tat get get ggc aaa gca gcc acc tea ggt gtg cca age 772 
Glu Ala Leu Tyr Ala Ala Gly Lys Ala Ala Thr Ser Gly Val Pro Ser 
235 240 245 



820 



868 



916 



964 



1012 



1060 



1108 



cat gaa gat gac tgg cga tct egg cct tec agg get cct gcc ctcaca 1156 
His Glu Asp Asp Trp Arg Ser Arg Pro Ser Arg Ala Pro Ala Leu Thr 

365 370 375 



ccc ate agg gat gag gag tgg aat cgc cac tec cct egg agt ccc aga 



36 



1204 



t 



Pro He Arg Asp Glu Glu Trp Asn Arg His Ser Pro Arg Ser Pro Arg 
380 385 390 

aca tgg gag cag gaa ccc ctt caa gaa cag cca agg ggt ggt tgg ggg 1252 

Thr Trp Glu Gin Glu Pro Leu Gin Glu Gin Pro Arg Gly Gly Trp Gly 
395 400 405. 



tct ggg egg cct egg gec cgc tct gtg gat get eta gat gac ate aac 
Ser Gly Arg Pro Arg Ala Arg Ser Val Asp Ala Leu Asp Asp He Asn 
410 415 .420 425 

egg cct ggc tec act gaa tea gga agg tct tct ccc cca agt agt gga 
Arg Pro Gly Ser Thr Glu Ser Gly Arg Ser Ser Pro Pro Ser Ser Gly 

430 435 440 



etc tat gac ccc gac gat cct aga gac ttg cca cat tec cga gat ccc 
Leu Tyr Asp Pro Asp Asp Pro Arg, Asp Leu Pro His Ser Arg Asp Pro 
460 465 470 

cac tat tat gat gat ttg agg tct agg gat cca cgt get gac ccc aga 
His Tyr Tyr Asp Asp Leu Arg Ser Arg Asp Pro Arg Ala Asp Pro Arg 
475 480 485 1 

tec cgt cag cga tec cac gat cct egg gat get ggc ttc agg tea egg 
Ser Arg Gin Arg Ser His Asp Pro Arg Asp Ala Gly Phe Arg Ser Arg 
490 495 500 505 

gac cct cag tat gat ggg cga etc tta gaa gag get tta aag aaa aaa 
Asp Pro Gin Tyr Asp Gly Arg Leu Leu Glu Glu Ala Leu Lys Lys Lys 

510 515 520 



gag gag ggc cac tat ccc cca gca cct ccg cct tac tct gag act gac 
Glu Glu Gly His Tyr Pro Pro Ala Pro Pro Pro Tyr Ser Glu Thr Asp 
540 545 550 



tttttaattg gaatattgat gaaactcttc accaagccta ataaaa 



<210> 15 
<211> .1682 



1300 



1348 



egg aga ggg egg gec tat gca cct ccg aga agt cgc age egg gat gac 1396 
Arg Arg Gly Arg Ala Tyr Ala Pro Pro Arg Ser Arg Ser Arg Asp Asp 

445 450 455 



1444 



1492 



1540 



1588 



ggg get ggg gag aga aga cgc gtt tac agg gag gaa gaa gaa gaa gaa 1636 
Gly Ala Gly Glu Arg Arg Arg Val Tyr Arg Glu Glu Glu Glu Glu Glu 

525 530 535 



1684 



teg cag gee teg agg gag egg agg atg aaa aag aat ttg gec ctg agt 1732 

Ser Gin Ala Ser Arg Glu Arg Arg Met Lys Lys Asn Leu Ala Leu. Ser 
555 560 565 

egg gaa agt tta gtc gtc tga tcccacgttt tgttatgtag cttttatact 1783 

Arg Glu Ser Leu Val Val * 

570 575 



1829 



37 



<212> DNA 

<213> Mus musculus 



<400> 15 

gcaccgtcgc tgctagacgg ccgcg atg gcg ccg gcg gcc age gcg tgt get 

Met Ala Pro Ala Ala Ser Ala Cys . Ala 
1 5 

ggg gcg cct ggc tec cac ccg gcc ace acg ate ttc gtg tgt ctt ttt 
Gly Ala Pro Gly Ser His Pro Ala Thr Thr He Phe Val Cys Leu Phe 
10 15 20 .25 

etc ate att tac tgc cca gac cgt gcc agt gcc ate cag gtg acc gtg 
Leu He He Tyr Cys Pro Asp Arg Ala Ser Ala He Gin Val Thr Val 

30 35 4.0 

cct gac ccc tac cac gta gtg ate ctg ttc cag cca gtg aca eta cac 
Pro Asp Pro Tyr His Val Val He Leu Phe Gin Pro Val Thr Leu His 

'45 50 55 

tgc acc tac cag atg age aat acc etc aca gcc cct ate gtg ate tgg 
Cys Thr Tyr Gin Met Ser Asn Thr Leu Thr Ala Pro He Val He Trp 
60 . 65 70 

aag tat aag teg ttc tgt egg gac cgt gtt- gcc gac gcc ttc tec cct 
Lys Tyr Lys Ser Phe Cys Arg Asp Arg "Val Ala Asp Ala Phe Ser Pro 
75 80 85 

gcc age gtg gac aac cag etc aac gcc cag ctg gcg get ggc aac ccc 
Ala Ser Val Asp Asn Gin Leu Asn Ala Gin Leu Ala Ala Gly Asn Pro 
90 95 100 105 

ggc tac aac ccc tat gtg gag tgc cag gac age gta cgc act gtc agg 
Gly Tyr Asn Pro Tyr Val Glu Cys Gin Asp Ser Val Arg Thr Val Arg 

110 115 120 

gtg gtg gcc acc aaa cag ggc aat get gtg acc ctg gga gac tac tac 
Val Val Ala Thr Lys Gin Gly Asn Ala Val Thr Leu Gly Asp Tyr Tyr 

125 130 135 

cag ggc agg aga ate acc ate aca gga aat get ggc ctg acc ttc gag 
Gin Gly Arg Arg He Thr lie Thr Gly Asn Ala Gly Leu Thr Phe Glu 
140 145 150 

cag acg gcc tgg gga gac agt gga gtg tat tac tgc tec gtg gtc tea 
Gin Thr Ala Trp Gly Asp Ser Gly Val Tyr Tyr Cys Ser Val Val Ser 
155 160 165 

gcc caa gat ctg gat ggg aac aac gag gcg tac gca gag etc att gtc 
Ala Gin Asp Leu Asp Gly Asn Asn Glu Ala Tyr Ala Glu Leu He Val 
170 175 180 185 

ctt gtt tat get get ggc aaa gca gcc acc tea ggt gtg cca age ate 
Leu Val Tyr Ala Ala Gly Lys Ala Ala Thr Ser Gly Val Pro Ser He 

190 195 200 



38 



I 



tat gcc ccc age ate tat ace cac etc tct cct gee aag act ccg cca 
Tyr Ala Pro Ser He Tyr Thr His Leu Ser Pro Ala Lys Thr Pro Pro 

205 210 215 

cct ccg cct gcc atg att ccc atg cgt cct ccc tat ggg tac cct gga 
Pro Pro Pro Ala Met He Pro Met Arg Pro Pro Tyr Gly Tyr Pro Gly 
220 * 225 230 



67 6 



tac agg ate cag get aac cag caa gat gac tec atg agg gtc eta tac 
Tyr Arg He Gin Ala Asn Gin Gin Asp Asp Ser Met Arg Val Leu Tyr 

270 275 280 

tat atg gag aag gag eta gcc aac ttc gat cct tec egg cct ggc cct 
Tyr Met Glu Lys Glu Leu Ala Asn Phe Asp Pro Ser Arg Pro Gly Pro 

285 290 . 295 

ccc aat ggc cga gtg gaa egg gcc atg agt gaa gta acc tec etc cat 
Pro Asn Gly Arg Val Glu Arg Ala Met Ser Glu Val Thr Ser Leu His 
300 305 310 

gaa gat gac tgg cga tct egg cct tec agg get cct gcc etc aca ccc 
Glu Asp Asp Trp Arg Ser Arg Pro Ser Arg Ala Pro Ala Leu Thr Pro 
315 320 325 

ate agg gat gag gag tgg aat cgc cac tec cct egg agt ccc aga aca 
He Arg Asp Glu Glu Trp Asn Arg His Ser Pro Arg Ser Pro Arg Thr 
330 335 340 345 

tgg gag cag gaa ccc ctt caa gaa cag cca agg ggt ggt tgg ggg tct 
Trp Glu Gin Glu Pro Leu Gin Glu Gin Pro Arg Gly Gly Trp Gly Ser 

350 355 360 



cct ggc tec act gaa tea gga agg tct tct ccc cca agt agt gga egg 
Pro Gly Ser Thr Glu Ser Gly Arg Ser Ser Pro Pro Ser Ser Gly Arg 
380 385 390 



tat gac ccc gac gat cct aga gac ttg cca cat tec cga gat ccc cac 
Tyr Asp Pro Asp Asp Pro Arg Asp Leu Pro His Ser Arg Asp Pro His 
410 415 420 425 

tat tat gat gat ttg agg tct agg gat cca cgt get gac ccc aga tec 



724 



gac ttt gac agg acc age tea gtt ggt ggc cac age tec cag gtg ccc 772 

Asp Phe Asp Arg Thr Ser Ser Val Gly Gly His Ser Ser Gin Val Pro 

235 240 245 

ctg ctg cgt gaa gtg gat ggg age gta tct tea gaa gta cga agt ggc 820 

Leu Leu Arg Glu Val Asp Gly Ser Val Ser Ser Glu Val Arg Ser Gly 

250 255 260 265 



868 



916 



964 



1012 



1060 



1108 



ggg egg cct egg gcc cgc tct gtg gat get eta gat gac ate aac egg 1156 
Gly Arg Pro Arg Ala Arg Ser Val Asp Ala Leu Asp Asp He Asn Arg 

365 370 375 



1204 



aga ggg egg gcc tat gca cct ccg aga agt cgc age egg gat gac etc 1252 
Arg Gly Arg Ala Tyr Ala Pro Pro Arg Ser Arg Ser Arg Asp Asp Leu 
395 400 405 



1300 



1348 



39 



t 



Tyr Tyr Asp Asp Leu Arg Ser Arg Asp Pro Arg Ala Asp Pro Arg Ser 

430 435 440 

cgt cag cga tec cac gat cct egg gat get ggc ttc agg tea egg gac 
Arg Gin Arg Ser His Asp Pro Arg Asp Ala Gly Phe Arg Ser Arg Asp 

445 450 455 

cct cag tat gat ggg cga etc tta gaa gag get tta aag aaa aaa ggg 
Pro Gin Tyr Asp Gly Arg Leu Leu Glu Glu Ala Leu Lys Lys Lys Gly 
460 465 470 



gag ggc cac tat ccc cca gca cct ccg cct tac tct gag act gac teg 

Glu Gly His Tyr Pro Pro Ala Pro Pro Pro Tyr Ser Glu Thr Asp Ser 

490 495 500 505 

cag gee teg agg gag egg agg atg aaa aag aat ttg gee ctg agt egg 

Gin Ala Ser Arg Glu Arg Arg Met Lys Lys Asn Leu Ala Leu Ser Arg 

510 515 520 



<210> 16 

<211> 594 

<212> PRT 

<213> Mus musculus 



<400> 16 

Met Ala Pro Ala Ala Ser Ala Cys Ala Gly Ala Pro Gly Ser His Pro 
1 5 10 15 . 

Ala Thr Thr He Phe Val Cys Leu Phe Leu He He Tyr Cys Pro Asp 

20 25 30 

Arg Ala Ser Ala He Gin Val Thr Val Pro Asp Pro Tyr His Val Val 
35 .40 45 

He Leu Phe Gin Pro Val Thr Leu His Cys Thr Tyr Gin Met Ser Asn 
50 55 60 

Thr Leu Thr Ala Pro He Val He Trp Lys Tyr Lys Ser Phe Cys Arg 
65 70 75 80 

Asp Arg Val Ala Asp Ala Phe Ser Pro Ala Ser Val Asp Asn Gin Leu 

85 90 95 

Asn Ala Gin Leu Ala Ala Gly Asn Pro Gly Tyr Asn Pro Tyr Val Glu 



40 



1396 



1444 



get ggg gag aga aga cgc gtt tac agg gag gaa gaa gaa gaa gaa gag 14 92 

Ala Gly Glu Arg Arg Arg Val Tyr Arg Glu Glu Glu Glu Glu Glu Glu 
475 480 485 



1540 



1588 



gaa agt tta gtc' gtc tga tcccacgttt tgttatgtag cttttatact 1636 
Glu Ser Leu Val Val * 

525 

tttttaattg gaatattgat gaaactcttc accaagccta ataaaa 1682 



100 

Cys Gin Asp Ser 
115 

Asn Ala Val Thr 
130 

Thr Gly Asn Ala 
145 

Gly Val Tyr Tyr 



Asn Glu Ala Tyr 

180 

Pro Glu Leu Leu 
195 

Phe Val Val Val 
210 

Gly lie Cys Trp 
225 

Arg Cys Pro Cys 



Ala Ala Gly Lys 

260 

Ser lie Tyr Thr 
275 

Ala. Met lie Pro 
290 

Arg Thr Ser Ser 
305 

Glu Val Asp Gly 



Gin Ala Asn Gin 

340 

Lys Glu Leu Ala 
355 

Arg Val Glu Arg 
370 

Trp Arg Ser Arg 
385 

Glu Glu Trp Asn 



Val Arg Thr Val 

120 

Leu Gly Asp Tyr 
135 

Gly Leu Thr Phe 
150 

Cys Ser Val Val 
165 

Ala Glu Leu lie 



Pro Gly Phe Arg 

200 

Val. Cys Leu Ala 
215 

Cys Gin Cys Cys 
230 

Cys Pro Asp Lys 
245 

Ala Ala Thr Ser 



His Leu .Ser Pro 

280 

Met Arg Pro Pro 
295 

Val Gly Gly His 
310 

Ser Val Ser Ser 

325 

Gin Asp Asp Ser 



Asn Phe Asp Pro 

360 

Ala Met Ser Glu 

375 . 

Pro Ser Arg Ala 
390 

Arg His Ser Pro 



105 

Arg Val Val Ala 



Tyr Gin Gly Arg 

140 

Glu Gin Thr Ala 
155 

Ser Ala Gin Asp 
170 

Val Leu Gly Arg 
185 

Ala Gly Pro Leu 



Ser Leu Leu Phe 

220 

Pro His Thr Cys 
235 

Cys Cys Cys Pro 
250 

Gly Val Pro Ser 
265 

Ala Lys Thr Pro 



Tyr Gly Tyr Pro 

300 

Ser Ser Gin Val 
315 

Glu Val Arg Ser 
330 

Met Arg Val Leu 
345 

Ser Arg Pro Gly 



Val Thr Ser Leu 

380 

Pro Ala Leu Thr 
395 

Arg Ser Pro Arg 



41 



110 

Thr Lys Gin Gly 
125 

Arg lie Thr lie 



Trp Gly Asp Ser 

160 

Leu Asp Gly Asn 
175 

Thr Ser Glu Ala 
190 

Glu Asp Trp Leu 
205 

Phe. Leu Leu Leu 



Cys Cys Tyr Val 

240 

Glu Ala Leu Tyr 
255 

lie Tyr Ala Pro 
270 

Pro Pro Pro Pro 
285 

Gly Asp Phe Asp 



Pro Leu Leu Arg 

320 

Gly Tyr Arg lie 
335 

Tyr Tyr Met Glu 
350 

Pro Pro Asn Gly 
365 

His Glu Asp Asp 



Pro lie Arg Asp 

400 

Thr Trp Glu Gin 



405 



410 



415 



Glu Pro Leu Gin Glu Gin Pro Arg Gly Gly Trp Gly Ser Gly Arg Pro 

420 425 430 

Arg Ala Arg Ser Val Asp Ala Leu Asp Asp He Asn Arg Pro Gly Ser 
435 440 445 

Thr Glu Ser Gly Arg Ser Ser Pro Pro Ser Ser Gly Arg Arg Gly Arg 
450 455 460 

Ala Tyr Ala Pro Pro Arg Ser Arg Ser Arg Asp Asp Leu Tyr Asp Pro 
465 470 475 480 

Asp Asp Pro Arg Asp Leu Pro His Ser Arg Asp Pro His Tyr Tyr Asp 

485 490 495 

Asp Leu Arg Ser Arg Asp Pro Arg Ala Asp Pro Arg Ser Arg Gin Arg 

500 505 510 

Ser His Asp Pro Arg Asp Ala Gly Phe Arg Ser Arg Asp Pro Gin Tyr 
515 520 525 

Asp Gly Arg Leu Leu Glu Glu Ala Leu Lys Lys Lys Gly Ala Gly Glu 
530 535 540 

Arg Arg Arg Val Tyr Arg Glu Glu Glu Glu Glu Glu Glu Glu Gly His 
545 550 555 560 

Tyr Pro Pro Ala Pro Pro Pro Tyr Ser Glu Thr Asp Ser Gin Ala Ser 

565 570 575 

Arg Glu Arg Arg Met Lys Lys Asn Leu Ala Leu Ser Arg Glu Ser Leu 

580 585 590 



Val Val 



<210> 17 
<211> 575 
<212> PRT 
<213> Mus 



musculus 



<400> 17 

Met Ala Pro Ala 

1 

Ala Thr Thr He 

20 

Arg Ala Ser Ala 
35 

He Leu Phe Gin 
50 



Ala Ser Ala Cys 
5 

Phe Val Cys Leu 



He Gin Val Thr 

40 

Pro Val Thr Leu 
55 



Ala Gly Ala Pro 
10 

Phe Leu He lie 
25 

Val Pro Asp Pro 



His. Cys Thr Tyr 

60 



Gly Ser His Pro 
15 

Tyr Cys Pro Asp 
30 

Tyr His Val Val 
45 

Gin Met Ser Asn 



42 



Thr Leu Thr Ala 
65 

Asp Arg Val Ala 



Asn Ala Gin Leu 

100 

Cys Gin Asp Ser 
115 

Asn Ala Val Thr 
130 

Thr Gly Asn Ala 
145 

Gly Val Tyr Tyr 



Asn Glu Ala Tyr 

180 

Val Val Cys Leu 
195 

Trp Cys Gin Cys 
210 

Cys Cys Pro Asp 
225 

Lys Ala Ala Thr 



Thr His Leu Ser 

260 

Pro Met Arg Pro 
275 

Ser Val Gly Gly 
290 

Gly Ser Val Ser 
305 

Gin Gin Asp Asp 



Ala Asn Phe Asp 

340 

Arg Ala Met Ser 
355 



Pro He Val He 
70 

Asp Ala Phe Ser 
85 

Ala Ala Gly Asn 



Val Arg Thr Val 

120 

Leu Gly Asp Tyr 
135 

Gly Leu Thr Phe 
150 

Cys Ser Val Val 
165 

Ala Glu Leu He 



Ala Ser Leu Leu 

200 

Cys Pro His Thr 
215 

Lys Cys Cys Cys 
230 

Ser Gly Val Pro 
245 

Pro Ala Lys Thr 



Pro Tyr Gly Tyr 

280 

His Ser Ser Gin 
295 

Ser Glu Val Arg 
310 

Ser Met Arg Val 
325 

Pro Ser Arg Pro 



Glu Val Thr Ser 

360 



Trp Lys Tyr Lys 
75 

Pro Ala Ser Val 
90 

Pro Gly Tyr Asn 
105 

Arg Val Val Ala 



Tyr Gin Gly Arg 

140 

Glu Gin Thr Ala 
155 

Ser Ala Gin Asp 
170 

Val Leu Asp Trp 
185 

Phe Phe Leu Leu 



Cys Cys Cys Tyr 

220 

Pro Glu Ala Leu 
235 

Ser He Tyr Ala 
250 

Pro Pro Pro Pro 
265 

Pro Gly Asp Phe 



Val Pro Leu Leu 

300 

Ser Gly Tyr Arg 
315 

Leu Tyr Tyr Met 
330 

Gly Pro Pro Asn 
345 

Leu His Glu Asp 



Ser Phe Cys Arg 

80 

Asp Asn Gin Leu 
95 

Pro Tyr Val Glu 
110 

Thr Lys Gin Gly. 
125 

Arg He Thr He 



Trp Gly Asp Ser 

160 

Leu Asp Gly Asn 
175 

Leu Phe Val Val 
190 

Leu Gly He Cys 
205 

Val Arg Cys Pro 



Tyr Ala Ala Gly 

240 

Pro Ser He Tyr 
255 

Pro Ala Met He 
270 

Asp Arg Thr Ser 
285 

Arg Glu Val Asp 



He Gin Ala Asn 

320 

Glu Lys Glu Leu 
335 

Gly Arg Val Glu 
350 

Asp Trp Arg Ser 
365 



43 



Arg Pro Ser Arg Ala Pro Ala Leu Thr Pro He Arg Asp Glu Glu Trp 
370 375 380 

Asn Arg His Ser Pro Arg Ser Pro Arg Thr Trp Glu Gin Glu Pro Leu 
385 390 395 400 

Gin Glu Gin Pro Arg Gly Gly Trp Gly Ser Gly Arg Pro Arg Ala Arg 

405 410 415 

Ser Val Asp Ala Leu Asp Asp He Asn Arg Pro Gly Ser Thr Glu Ser 

420 425 430 

Gly Arg Ser Ser Pro Pro Ser Ser Gly Arg Arg Gly Arg Ala Tyr Ala 
435 440 445 

Pro Pro Arg Ser Arg Ser Arg Asp Asp Leu Tyr Asp Pro Asp Asp Pro 
450 455 460 

Arg Asp Leu Pro His Ser Arg Asp Pro His Tyr Tyr Asp Asp Leu Arg 
465 470 475 480 

Ser Arg Asp Pro Arg Ala Asp Pro Arg Ser Arg Gin Arg Ser His Asp 

485 490 495 

Pro Arg Asp Ala Gly Phe Arg Ser Arg Asp Pro Gin Tyr Asp Gly Arg 

500 505 510 

Leu Leu Glu Glu Ala Leu Lys Lys Lys Gly Ala Gly Glu Arg Arg Arg 
515 520 525 

Val Tyr Arg Glu Glu Glu Glu Glu Glu Glu Glu Gly His Tyr Pro Pro 
530 535 540 

Ala Pro Pro Pro Tyr Ser Glu Thr Asp Ser Gin Ala Ser Arg Glu Arg 
545 550 555 560 

Arg Met Lys Lys Asn Leu Ala Leu Ser Arg Glu Ser Leu Val Val 

565 570 575 



<210> 18 ^ 
<211> 526 
<212> PRT 

<213> Mus musculus 
<400> 18 

Met Ala Pro Ala Ala Ser Ala Cys Ala Gly Ala Pro Gly Ser His Pro 
15 10 15 

Ala Thr Thr He Phe Val Cys Leu Phe Leu He He Tyr Cys Pro Asp 

20 25 30 

Arg Ala Ser Ala He Gin Val Thr Val Pro Asp Pro Tyr His Val Val 
35 40 45 



44 



He Leu Phe Gin Pro Val Thr Leu His Cys Thr Tyr Gin Met Ser Asn 
50 55 60 

Thr Leu Thr Ala Pro He Val He Trp Lys Tyr Lys Ser Phe Cys Arg 
65 70 75 80 

Asp Arg Val Ala Asp Ala Phe Ser Pro Ala Ser Val Asp Asn Gin Leu 

85 90 95 

Asn Ala Gin Leu Ala Ala Gly Asn Pro Gly Tyr Asn Pro Tyr Val Glu 

100 105 110 

Cys Gin Asp Ser Val Arg Thr Val Arg Val Val Ala Thr Lys Gin Gly 
115 120 125 

Asn Ala Val Thr Leu Gly Asp Tyr Tyr Gin Gly Arg Arg He Thr He 
130 135 140 

Thr Gly Asn Ala Gly Leu Thr Phe Glu Gin Thr Ala Trp Gly Asp Ser 
145 150 155 160 

Gly Val Tyr Tyr Cys Ser Val Val Ser Ala Gin Asp Leu Asp Gly Asn 

165 170 175 

Asn Glu Ala Tyr Ala Glu Leu He Val Leu Val Tyr Ala Ala Gly Lys 
. 180 185 190 

Ala Ala Thr Ser Gly Val Pro Ser He Tyr Ala Pro Ser He Tyr Thr 
195 200 205 

His Leu Ser Pro Ala Lys Thr Pro Pro Pro Pro Pro Ala Met He Pro 
210 215 220 

Met Arg Pro Pro Tyr Gly Tyr Pro Gly Asp Phe Asp Arg Thr Ser Ser 
225 230 235 240 

Val Gly Gly His Ser Ser Gin Val Pro Leu Leu. Arg Glu Val Asp Gly 

245 250 255 

Ser Val Ser Ser Glu Val Arg Ser Gly Tyr Arg He Gin Ala Asn Gin 

260 265 270 

Gin Asp Asp Ser Met Arg Val Leu Tyr Tyr Met Glu -Lys Glu Leu Ala 
275 280 285 

Asn Phe Asp Pro Ser Arg Pro Gly Pro Pro Asn Gly Arg Val Glu Arg 
290 295 300 

Ala Met Ser Glu Val Thr Ser Leu His Glu Asp Asp Trp Arg Ser Arg 
305 310 315 320 

Pro Ser Arg Ala Pro Ala Leu Thr Pro He Arg Asp Glu Glu Trp Asn 

325 330 335 

Arg His Ser Pro Arg Ser Pro Arg Thr Trp Glu Gin Glu Pro Leu Gin 

340 345 350 



45 



Glu Gin Pro Arg Gly Gly Trp Gly Ser Gly Arg Pro Arg Ala Arg Ser 
355 360 365' 

Val Asp Ala Leu Asp Asp lie Asn Arg Pro Gly Ser Thr Glu Ser Gly 
370 375 380 

Arg Ser Ser Pro Pro Ser Ser Gly Arg Arg Gly Arg Ala Tyr Ala Pro 
385 390 395 400 

Pro Arg Ser Arg Ser Arg Asp Asp Leu Tyr Asp Pro Asp Asp Pro Arg 

405 410 415 

Asp Leu Pro His Ser Arg Asp Pro His Tyr Tyr Asp Asp Leu Arg Ser 

420 425 430 

Arg Asp Pro Arg Ala Asp Pro Arg Ser Arg Gin Arg Ser His Asp Pro 
435 440 445 

Arg Asp Ala Gly Phe Arg Ser Arg Asp Pro Gin Tyr Asp Gly Arg Leu 
450 455 460 

Leu Glu Glu Ala Leu Lys Lys Lys Gly Ala Gly Glu Arg Arg Arg Val 
465 470 475 480 

Tyr Arg Glu Glu Glu Glu Glu Glu Glu Glu Gly His Tyr Pro Pro Ala 

485 490 495 

Pro Pro Pro Tyr Ser Glu Thr Asp Ser Gin Ala Ser Arg Glu Arg Arg 

500 505 510 

Met Lys Lys Asn Leu Ala Leu Ser Arg Glu Ser Leu Val Val 
515 520 525 



<210> 19 

<211> 22976 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> exon 
<222> 1898. .2253 

<221> exon 
<222> 3437. .3781 

<221> exon 

<222> 12065. .12184 

<221> exon 

<222> 15045. . 15101 

<221> exon 

<222> 15666. .15812 



46 



'I 



<221> exon 

<222> 19479. .19652 

<221> exon 

<222> 19799. .19858 

<221> exon 

<222> 19956.. 20087 

<221> exon 

<222> 20229. .20854 

<221> exon 

<222> 20944. .21094 

<221> Misc_Feature 
<222> 19956.. 19958 

<223> Potential variant splicing site AAG 



<400> 19 
aacagtttgg 


cagttcctca 


aaaggttaaa 


aatagaacta 


ccaagtcacc 


cagcaattcc 


60 


attcttaggc 


atatattcaa 


aagaaatgaa 


agcagatatt 


tgtacaccag 


tgttcacagc 


120 


tgcactattt 


acaatagtca 


aaaggtagaa 


acaacctagg 


tccatccaca 


aatgaatgga 


180 


taaataaaac 


gtagcatata 


catacaatgg 


tacactagtc 


cgctgtaaaa 


agaaattttg 


240 


atcttactqc 


atgctacatg 


qcttcgacat 


actacaacat 


ggatggacct 


tgaaaacatt 


300 


attctttgtg 


aaataaacta 


gacacaggac 


aaatgttaga 


cgattccact 


tatatgaggc 


360 


acctagaatg 


ggcaatttgg 


taagcaaagt 


agaatagaaa 


ttactagggg 


cacaggtagc 


420 


agggaatggg 


gagttactgt 


ttaatggtca 


cagagtttat 


gttggggatg 


atgaaacagt 


480 


ttcggggata 


aagagtggtg 


attggtacac 


gacattgtga 


atatacttaa 


tgccactgaa 


540 


ttttacactt 


gaagtggtta 


aagcgataaa 


tattatagtt 


tgcatatttt 


atcataaaaa 


600 


tattttttta 


aacgatgaag 


ggacgtgaac 


gggttgaaat 


tttataaaaa 


gtggccaggg 


660 


aaggtgtcac 


tgcaatggtg 


tcctacagga 


ggaggaagat 


catgtggaca 


tctgcgggaa 


720 


gggtgttctg 


gcagagggag 


tagcacgggc 


gatggctctg 


aggactgtga 


gaagtatagt 


780 


tggaaacagc 


gaggaggcca 


gggtgtccga 


agctgagtaa 


gccagagaga 


gtgggaggag 


840 


gtgagataag 


agggggaagg 


tcagtttctg 


ctgagagtga 


ggaggagcca 


caggagggct 


900 


gtgagcaggt 


ggacgtgatc 


tggcttgagt 


tttaacaggg 


ccagtagaac 


aaagcacgcc 


960 


tgggtaccga 


aaccagccac 


tggccagttg 


gcaacctggg 


ggagtctaac 


gcgaggaagc 


1020 


gcccagggtt 


cccccaggat 


gcgctttccc 


tcgccgccac 


ctggagacag 


cagagtcacg 


1080 



cccagcgctg 


cgcaggctga 


tcgccgcgcc 


gcgcccccgc 


cctcggtcgc 


aggtggctcg 


1140 


ttccgggaat 


tcctaagcgg 


aaaccggtcc 


caagccccgc 


gccttcgctc 


ggccccttta 


1200 


agagccagaa 


tttccggagg 


gctgacccgg 


gggctaggga 


tgcccagggg 


ccgaaccaca 


1260 


agttgggaac 


gggtggggga 


ggtggcgaaa 


acttccgaag 


tggaattcca 


acttttcctg 


1320 


gccctgattc 


cccttgggca 


tccctgaggg 


ggcagagctt 


cccttccggg 


gactttagag 


1380 


ggttcctcag 


gtcatctaac 


tgggagacac 


aggaggcccg 


aagcgccccc 


cctccacccg 


1440 


gtccggagga 


accccagtgg 


aagtggagaa 


gtcaggcgcc 


accaacaagc 


ctctcccagc 


1500 


caggactttg 


cttagactcg 


ctcctcccgg 


cagggcgcac 


ctaggcgggt 


ccatcgccag 


1560 


ccggggagag 


gggtttgggc 


agggagggaa 


caggtgcgcg 


gcgggacccg 


ccctatctca 


1620 


acaggtgaat 


cgctccaagt 


gggtctcggt 


tgcatggatc 


tcggtgcgct 


tggtttggcc 


1680 


ggagcagatg 


ggggccggaa 


gggacctgtg 


gtccgcaggc 


gccctcccag 


cgggccagtc 


1740 


acttggttcg ggccctgggg gacggagcgc 


acctgggtca 


gcccacttcc 


ggggagggag 


1800 


gcagaggaac 


ccctccccgc 


cgctcacccc taagcccagc 


cctcggctcc. 


cacccttgtg 


1860 


tacctgggcc 


gaaccattca 


ccggagcgcg 


cagcgggtgg 


agtgtggctc 


ggaggaccgc 


1920 


ggcgggtcaa 


gcacctttct 


cccccatatc 


tgaaagcatg 


ccctttgtcc 


acgtcgttta 


1980 


cgctcattaa 


aacttccaga 


atgcaacagg 


acggacttgg 


agtagggaca 


aggaacggaa 


2040 


gtgggaaggg 


gaggagcgtg 


cacccctcct 


ggccttggtg 


cgcgccgcgc 


cccctaaggt 


2100 


actttggaag 


ggacgcgcgg 


gccagacgcg 


cccagacggc 


cgcgatggcg 


ctgttggccg 


2160 


gcgggctctc 


cagagggctg 


ggctcccacc 


cggccgccgc 


aggccgggac 


gcggtcgtct 


2220 


tcgtgtggct 


tctgcttagc 


acctggtgca 


caggtacggg 


gcacggggcc 


tctgacgctg 


2280 


cggaacgccg 


gagggaactg 


tagaggggga 


tggatggagt 


tggaggcggc 


gggaagcggg 


2340 


aagcgggggt 


ctcagaggct 


gggaccttcc 


gatcccctgg 


gtcttgggcg 


atctgttgcg 


2400 


cgcgggagtg 


agaggaattc 


cccatttgtg 


ccggggagcg 


ctccccgcgc 


ccttatctgg 


2460 


aagatagcag 


gaagtgaaac 


tccctggacg 


gtgagacccg 


gagcggcagg 


gagaatggaa 


2520 


ctctttgtgg 


ggagggagtg 


gaagaccgcc 


cgatctctgg 


gaaaagaaaa 


gccgggatgg 


2580 


gacttgggcg 


cacccgggga 


tttctaagtt 


ttggagtaac 




cacaQcragqg 


2640 


ctggatcaga 


cgcttcctag 


agggacagag 


acgaaggaac 


aatgcctagg 


cctcgggtgg 


2700 


gtgtgggact 


ggggactccc 


catcccccgc 


accccaccca 


cctcccgcgg 


gctccggatt 


2760 


atacgtgcgt 


aagagtctgg 


tgggatggat 


ttacggactt 


gaaaccgact 


tctgctggca 


2820 
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ggctttcacc 


tggatgggat 


atttgggtgg tgatgaggtc 


tttcccgaga 


cacttttggt 


2880 


tcagtcattt 


gaaatgactt tagagtaggg tgaggtggtg ggaggctgat 


ggagatattg 

777 — * 


2940 


tgggggcttt 


agtccctcca 


tggcaaagca gttcaggcaa 


acaactccat 


ggttttccct 


3000 


ccaaattcaa 


aaggccccgg 


gtaacctgga atccttcgta 


gtcggttttg 


aagtggggcc 


3060 


ttgggcgctg 


ggggcatcaa 

7 7 7 7 


catggccatc tgggcttgcc 


tgcccaggcc 


acacagaggc 


3120 


cccttgttgt 


gggtgaatgg 

3 7 7 7 3 3 


caaagggaag aggggactgg 


tgtggttcag 


aggccacagg 

7 7 3 3 


3180 


ctgggaagag 


ggatggcggg 


cgagtccaag gaaactggcc 


gtgtcaccgt 


gcacctgcca 

7 7 


3240 


cttcagcccc 


acgggtctat 


aaaatgggca tgattatcgt 


ggctacctca 


ctggtcctgg 

7 7 3 3 


3300 


caattaagga 


acaatgtgtg 


ccaggcactc tgtaaaccac 


atacttgcga 


gtgtcaagct 

3 3 7 


3360 


ggtgacaggt 


ggcgttcctg 


ttgaagcacc tccctgagct 


cacagcaacc 


cttgctgtct 


3420 


ctcctcttgc 


cctcagctcc 


tgccagggcc atccaggtga 


ccgtgtccaa 


cccctaccac 


3480 


gtggtgatcc tcttccagcc tgtgaccctg ccctgtacct 


accagatgac 


ctcgaccccc 


3540 


acgcaaccca 


tcgtcatctg 


gaagtacaag tctttctgcc gggaccgcat cgccgatgcc 


3600 


ttctccccgg 


ccagcgtcga 


caaccagctc aatgcccagc 


tggcagccgg 


gaacccaggc 


3660 


tacaacccct 


acgttgagtg 


ccaggacagc gtgcgcaccg 


tcagggtcgt 


ggccaccaag 


3720 


cagggcaacg 


ctgtgaccct gggagattac taccagggcc ggaggattac catcaccgga 


3780 


agtatgttgg 


gcagggcagg 


gggatgaggc tgggcttgcc 


cgggtggtgg 


gactggcgtc 


3840 


cttgtgcggg acctggagtc 


cccatctgaa agctcttgag 


tgccagtgtc 


tgaaaggacc 


3900 


attgaaggga 


gcaattcttt 


tttttttttt ttttgaagat 


ggagtcttgc 


tctggactcc 


3960 


aggctggagt 


gcagtggtgc 


gatctcagct cactgcaacc 


tccacctccc 


aggttcaagc 


4020 


aattctcttg 


cctcagcctc 


ccgagtagct gggactccag 


gtgcgtgcca 


ccacgcccag 


4080 


ttaatttttg 


tatttttagt 


agagatgggg tttcaccatg 


ttggccaggc 


tggtctcaaa 


4140 


ctcctgacct 


caaatgatct 


gcccgccttg gcctcgcaaa 


gtgctgagag 


acaccatacc 


4200 


cagcctaaag 


ggagcgattc 


tattctacta ttcttccttc 


tgctaatcct 


tccattcttt 


4260 


aatttaataa 


cgaagatttt 


ttgagtacct gtcatatacc 


aggtgctgtt 


ctgggccctg 


/ *3 O Pi 


ggaatacagc 


tgttaacaaa 


atcatcaaac cacttccctc 


gtggagccca 


cattgcagtg 


4 380 


agagagacaa 


acacgacaca 


cactctcaag tccttgaaga 


taaagaaaac 


tgggtaacgg 


4 4 40 


agagaagagg 


ccagggtttg 


ttctataatc attaataaca 


cgagcagtaa 


gaagtaaaat 


4500 
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ttatctaagt aacaacttat aaagggtcta ctgtgtgcta agctctcatc caggttccca 4560 

aggattaact cagaccacac agtaattgaa tagattctat cattgtcatc ttacagaggc 4 620 

ccagagagag aaagtgactt gcctagtgtc atagctggta acggggctgg gattctaact 4 680 

cagccacttt gggtctagtg gccaagctcc taatcccttt gcttgcctag ggtggtccgc 4740 

agaggactca cagaggagat ggcaggagtg aactgcaggg °gcaagagagc ttaatggaga 4800 

aagcctgtga catgccagga actgcacaca tattctccca ttgagtcctc tcctctaccc 4860 

tcctgacagc tgaggcacag agaggttacc ttgttcaaat gggtgcatag gaagtcaaag 4 920 

tctggagctg gggtttgaac ccaggcagcc ctgagaacct tgttcttttt ttttgagacg 4 980 

gagtctcgct ctgtcgccca ggctggagtg cagtggcggg atctcggctc actgcaactc '504 0 

cgcctcccgg gttcacgcca ttctcctgcc tcagcctccc aagtagctgg gactacaggc 5100 

gcccgccact acgcccggct aattttttgt atttttagta gagacggggt ttcaccgttt 5160 

tagccgggat ggtctcgatc tcctgacctc gtgatccgcc cgcctcggcc tcccaaagtg 5220 

ctgggattac aggcgtgagc caccgcgccc ggccccttgt tcttaactgt aatgctgcct 5280 

cctgatagga tgtgcctgtt gggactaagt aaggggcagt cattcattca ttcatttggt 534 0 

atttatcaag catcgactat gtgtcgttgg tgctggggat agaggtgatt gggatggctg 5400 

aagtttctgt cgtcaaggag atgacattct ggtggagtga gactggcagt aaataagcag 54 60 

ataaagaaag agtatgagaa tttcaaagtc tgggcacggt ggctcacgtc tgtaatctca 5520 

gcactttggg aggccaaggt gggtggatca cctgaggtca ggagttccag accagcctgg 5580 

ccaacatggt gaaaccccgt ctctactaaa aatacaaaga ttagccaggc atggtggcac 5640 

atgcctgtaa tcccagctac tcaggaggct gaggcatgag aatcgcttga acccaggagg 5700 

cagaggttgc agtgagctga gatcgcacca ctgtactgca gtctgggcga cagagtgaga 57 60 

ctctgtctca aaaaaaaaaa aaaaaaaaaa gactccgtca aggtataaga atgtcagaga 5820 

gtactaagtg ttgcaaagaa aataacacca ggctgggtgc attggctcat gcctgtaaat 5880 

ttcagcactt tgggaggcca aggcaggagg atcacttgag cctaggagtt tgagaccagc 5940 

ctggacaaca aaatgagacc ccatgtctac aaaaatttta aaaatttaaa aattagctgg 6000 

gcatggtggc atgtgcctgt ggtcccggct gctcaggagg ctgaggtggg aggattgctt 6060 

gggcttgaga ggtcaaggct tcagtgagtc atgatcgtgc cactgcattc cagcctgggt 6120 

gacagagtga gaccctgtct tgaaatgaaa agaaaatagg ctgggcgcag tggctcacac 6180 

ctgtaatccc agcactttgg gaggccgagg tgggtggatc acctgaggtc aggagatcga 624 0 
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gaccagcctg 


gccaacatgg 


tgaaatccca 


tctctactaa 


aaatacaaaa 


tttagccggg 


6300 


cgtggtggtg 


ggcgcctgta 


atcccagcta 


ctcgggaggc 


tgaggcagga 


gaatcgcttg 


6360 


aacctgggag 


gcgaaggttg 


cggtgcgcca 


agattgcgcc 


actgcactct 


agcctgggaa 


6420 


acagtgagac 


tccgtcttaa 


aaaaaaaaga 


aaaaagaaaa 


tagcactggg 


tgatgtgcta 


6480 


catggaatga 


cttgggctgt 


gaatatgatt 


tgaggagggc 


ctgggcctgg 


gccttacaga 


6540 


acctagaagg 


cagagaggaa 


ggggaggggc 


agggtgccag 


ggatgaaggc 


tcacgtacct 


6600 


catgtcttag 


tgtgtgttca 


ctgtcttaaa 


caagaattta 


aagttgggca 


tggggcagag 


6660 


cggggaaggg 


agcatccctt 


tgcagacccc 


aagaagccag 


gaactggagc 


acattctgct 


6720 


agaggatcga 


tgggaagcag 


ggttccaggg 


gctgagccta 


tgtcagtcct 


gtttcagagg 


6780 


aggcaccagg 


cttgcttgcc 


ctgaatttct 


gtgggcagct 


cagccatgag 


catcctactg 


6840 


ttattgaggt 


cacagggctg 


cttaggcccc 


ctcctctcta 


acccagggat 


tgtgcctgcc 


6900 


tggaccaggc 


gtgactgcta 


agcttctgcc 


aggacaagcc 


aaatactgag 


ggtgcttcct 


6960 


ctgctggacg 


caaaagtcca 


ggatgacccc 


ccaggctctg 


tctcggggaa 


ggggccctgc 


7020 


atgctccagg 


ggcctcacag 


gcctgggtct 


ttcaaaccac 


ccccacctgg 


gcctgtgttt 


7080 


gatcaaggcc 


ctgagtgtaa 


acatccattg 


tgtgtgtcct 


ttcaggaaat 


cccatagcca 


7140 


taggagcttc 


ctctgtttca 


gctttgagga 


tggggaaaag 


tggactcccc 


gtggtgttcc 


7200 


tagggtcacc 


cactgtgctg 


gggtttttct 


gttgttgttg 


ttttttttct 


gttgcccagg 


7260 


ctggagtgca 


gtggtgcaat 


ctcagctcac 


tgcaacctct 


gcctcgcaag 


ttcaagtgat 


7320 


tctccgcctc 


agcctcctga 


gtagctggga 


ttacaggtgc 


acaccaccac 


acctggctaa 


7380 


tttttgtatc 


tttttggtag 


agatgggatt 


tcgccatgtt 


ggccaggctg 


gtctcaaact 


7440 


cctgacctca 


ggtgatctgc 


ctgccttggc 


ctcccaaagt 


tctgggatta 


cagatgtgag 


7500 


ccaccatgcc 


cggcctatcc 


tggtttcaaa 


agtgaaaata 


gtcctggata 


aggtagaagg 


7560 


ctgtccactc 


caggcatccc 


tccggtccgg 


tggctcattc 


cctgctttgt 


ccttccatgc 


7620 


tttgggtgat 


ggaccagcac 


ctggacagga 


ggccctgttc 


cacctcctcg 


ggctccttgg 


7680 


ggtccaagtg 


cccccacctc 


cagctgcact 


gcagcagaga 


gcccatggga 


cctctgaaat 


7740 


catgaaggtc 


acctttgcgg 


tgtataaaga 


aggaaccaga 


ggttggagat 


gtggaggagg 


7800 


cctggctgct 


gttcccactg 


gagacctggc 


atcttctccc 


cgacctaaaa 


caatgaaagc 


7860 


agtgctcagc 


ccggatgaga 


tcacggccag 


cccaagacca 


ggaacagggt 


acgccctgca 


7920 
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ggaagaaggt 


gtgcccagac 


cttaggatgg 


atcaaaagaa 


gccggaaaac 


tatatttttt 


7980 


gtgagttttg 


aaaatgtcag 


acaggtcaaa 


caaaacacag 


tgaggtccag 


cctcggccta 


8040 


caagatgcca 


gatttcaacc 


cctggcctat 


atgatctgtt 


tgccatggca 


ggcggttcct 


8100 


gtccacctct 


tttgtttata 


gcagggacca 


gctcttgagc 


tccagtgttg 


aagaggcacg 


8160 


gtcagggtct 


gatctgaaga 


cactggtggc tcatgcctgt 


aatcccagca 


cttcaggagg 


8220 


ccgaggcagg 


aggattgctt 


gaggacagga 


gctgggagac 


cagcctgggc 


aacacagtga 


8280 


gacccagaga 


ctacaaaaaa 


ataaatttag 


cggggcatga 


tggcacaccc 


tgctactctg 


8340 


gagatgggaa 


gattgcttga 


gcctaggagt 


tcgaagctgc 


agtgacccat 


gatcgcacca 


8400 


ctgcactcca 


gcctgggcga 


ccaagctagg 


ccctctcaaa 


aaagatacag 


gtggaaaaat 


8460 


gatggacgaa 


gagggcattg 


tggcaaacct 


ggggatttag 


gagaacctag 


tttggaattc 


8520 


tatgaggatt 


caatgaaaga 


atgtgtgtag 


aggggcccag 


cacatagtaa 


gagctcaata 


8580 


aacggtgggg 


gctaggggcg 


gtggctcatg 


cctgtaatcc 


cagcactttg 


ggaggctgag 


8640 


gcaggtggat 


cacttgagcc 


ctggagttca 


agatcaacct 


ggacaacaaa 


gcaagatccc 


8700 


atctcaaaat 


taaaaaacaa 


caccaacaac 


aaaaaaacag 


tggcttagat 


gcctgatcat 


8760 


tagggtaagt 


cgtgtcctca 


accccttcac 


atctgctctg 


aaggtcacca 


tatccggaag 


8820 


ccttccctgg 


cctccttgtt 


taaaatggca 


cagcccccac 


tccacgcctg 


gcactctctg 


8880 


ctgtccctga 


ttcgttttct 


ccatacagct 


tatctttgtc 


tgatatgtga 


catagttaac 


8940 


attttatatt 


tgtctttctt 


tcctagttag 


aatctgaact 


ctagaagggc 


aagggcaagg 


9000 


atttataact 


caaaggttcc 


gggcttaggc 


ctcttttata 


ttcttgattt 


tgaggttaat 


9060 


taagagctca 


ggcctagcga 


ggtggctcat 


gcctggaatc 


ccagcacttt 


gggaggccca 


9120 


ggcgggcaga 


tcacttgagg 


tcaggagttc 


cagacctgcc 


tggccaacac 


agtgaaaaac 


9180 


ctgtctctac 


taaaaataca 


aaaattagcc 


agttatgttg 


gcaggcgcct 


ataatcccag 


9240 


ctactcaaga 


ggctgaggca 


ggagaatcgc 


ttgaacccag 


gaggcagagg 


ctgcagtgag 


9300 


ccaagatcgt 


gccactgcac 


tccagcctgg 


gcaacagagc 


gagactccat 


ctcaaaaaaa 


9360 


aaaaaaaaat 


taagagctca 


aagagtttgt 


tttcataggc 


agcagaatga 


gaaaagttta 


9420 


caaaatacrtt 


taaatgacaa 


taaagtcatt 


atagattaac 


ataaataaaa 


taccttttat 


9480 


gaaaaaaata 


atcattttct 


gaaatcagac 


aaaacattgt 


gaatgagaag 


gtggcatggt 


9540 


tttatttttt 


tgcaagtctc 


cgaagcctgg 


ctggatagaa 


gagcctggct 


tctcagagct 


9600 


gcttcagtct 


gttgtgatat 


ctattgtatg 


tcacgtagcc 


tctggaaaac 


tccacagtta 


9660 
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gtattgttgg 


gaaaataact 


ttgacctcag gatctcctga aaacgtcttg gggaacccca 


9720 


gggtctagag gctgcagttt gagaactgtt gctgtggtat cccaggtgtc tcaaatactg 


9780 


ppt a naacat 

v. ^ i» o u o a w 


aoataotact 


cagtaattat tgttgaagga tgaatgaatg aatgaatgaa 


9840 


t naataaato 


aaaaaaaqaa 


agaaatgtgt ctttgaatcc agccatgtgc ccagaatgat 


9900 


rrarrapacratlQ 


acaaaaocta 


aqqqacttta gcatgaggag agggggttcg tttccttttt 


9960 

* 


tttctttttt 


ttttqaqatg 


gagtc.tcact ctactgccca ggctagagtg cagtggtgca . 


10020 


atctcaactc 


actgcaatct 


ctqcctcctg aqttcaagca attctcctgc ctcagcctcc 


10080 


n ncrt a OC to 


aoactacaqq 


tqcqtqccac catgcctagc taatttttta catttttggt 


10140 


a /-t =3 rr a to aero 


ttttaccata 


ttgqccgggc tggtctggaa ctcctgacct caagtgatcc 

U mf ml mf ml — 


10200 




acct cccaaa 


atqttaqqat tacaggtgtg agccaccatg tccggccaag 


10260 


=3 rrnnt" crtt ca 


tttctactcc 


ttqccaggta ttgtgtcagg cactggggac ccagcagtgg 


10320 




caoaact eta 
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cctcacggag cccacatttt c'accaggcaa aggatggtcg 


10380 




t aaaaaataa 

w \U \A \* S-4 Wl w w* wv 


qacttcagca gttgggtggg ggagccgtgg gagaagccca 


10440 


gcccacaggg 


ggacagtgca 


aatctagaac caaggcgatg gcaggggtga ggctggcacg 


10500 


gtagctagag 


accacgtcgt 


accaaqgqcc ttggggacca tgggactatg ggaccttagg 


10560 


gaaggcgtct 


ggaatgctgt 


aqccaqacac tgttgcaagg aggatttttc tgtagacatg 


10620 


aggccttcct 


tatgaagaaa 


qcaagggttc tttcattcct gggggtgcca ggtgctgtgg 


10680 


actgcagcac 


gcgtggttgc 


tqccqtcaca gagctgtcat gcaggagggc agcgcgtcct 

\m*- N|< >l ^ ^m** - ^1 mj mm* mm* mm* mm* mm* ^ 


10740 


tgggaaggtg gcaggcaggt 


caqqctaqga qgaaagaggc cgggaagctg agggcatttc 


10800 


ctgcccgaga 


tgcccaatgt 


aqcctacttc tgtccccagt ggcttaaggc agagttgcct 


10860 


ggtaggtgcc 


ctggtcccac 


cctaqtqaaa qqctqaaggt atttaattag tgcctgagaa 


10920 


gcagagagga 


aacaggatgt 


accaaaacac tttqatggat ggtagagtta acaggctcct 


10980 


tgcctgcagc 


tgcttcagac 


aagagcgtcc ccaagccctg ggcctgacct ggaatgnggg 


11040 


gatggaaggg 


gagggggagg 


aaccaaggca ctgggagggt aagtctctct ctcccacata 


11100 


gacacaccca 


ctccttatgg 


gtgcctgggc atctcctggt acctagaatc tggcctgttt 


11160 


atctccacac 


ccatccctgg 


ggtctacact aggccctgtg ggtggcagtt cacatcaggg 


11220 


gagttctgac 


tttggctctg 


agaggtggtt cagagatggc tgtaagttga gaagcacaga 


11280 


ctgctgggtg 


tggtggttca 


cgcctgtaat cccagcactt tgggaggctg aggtgggggt 


11340 
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ggatcacctg 


aggtctggag 


ttcaaaacca 


acttggtcaa 


catggcgaaa 


ctccatctct 


11400 


actaaaaatg 


caaaaattag 


ccaggtgtgg 


tggcaggtgc 


ctataatccc 


agctacatgg 


11460 


gaggctgagg 


caggagaatt 


gcttgaatct gggaggcgaa gattgtagtg agccgagatt 


11520 


agttcgcacc 


attgcatgcc 


agcctgggca 


acaagagtga 


aactccgatt 


caaacaaaaa 


11580 


aaaaaaaaag 


ctgggcatgg 


tggagtgcct 


gtagtcctaa 


ctactcaggt 


gggaggattg 


11640 


cttgagtcca 


ggaggttgaa 


gttgcagtgg 


gctataatta 


caccactgca 


ctccagccag 


11700 


ggccacagag 


tgagaccctg 


tctctaaaga 


aagaaaaaaa 


aaaacaacct 


caggctccga 


11760 


gggcaccatt 


actgctctat 


actgaagagc 


tgtgcagctt 


ttccagaccc 


gaaatgtcat 


11820 


ccacaaaaca 


gaagtgataa 


tggtcctgcc 


tcacagactt 


cttgcagtag 


tccaggtgtt 


11880 


tagaacgggg 


tgtaaaaggc 


cgtgtgccct 


tggtaggaat 


ctttgcatat 


gcatttgatc 


11940 


atctgcagcc 


tgcccagccc 


actgcttgcc 


ccctcctggg 


tgtgctggga 


aggggtcttt 


12000 


ggccctccag. 


gggttaggtg 


ccccagcctc 


caaggtgccc 


tcacgccttt 


tcatcccgac 


12060 


tcagatgctg 

mm mW mw 


acctgacctt 


tgaccagacg 


gcgtgggggg 


acagtggtgt 


gtattactgc 


12120 


tccgtggtct 


cagcccagga 


cctccagggg 


aacaatgagg 


cctacgcaga 


gctcatcgtc 


12180 


cttggtgagt 


gggcctggga 


agggggaggc 


atggcccttc 


cttttgtccg 


cttctgttct 


12240 


qtctgccctc 


ccctgtgtcc 


gccctctgcc 


ctccagctta 


ccctctgggc 


tctgtcgcct 


12300 


gctctgctct 


cccccaggct 


ctgccagtca 


cttaggctcc 


cctgtgccct 


gcaccccagg 


12360 


cagggaccac 


tggcccacag 


tgcctccaat 


cacccaagcc 


aaactaagag 


aagagtggag 


12420 


acaattggag 


actctgcctt 


ttcaaagtct 


catttttaaa 


aaaaatccag 


acttggggtc 


12480 


cgggtgcggt 


agttcatgcc 


tgtaatccca 


gcactttggg 


aggccgaggc 


gggtggatca 


12540 


cttgaggcca 


ggagttcgag 


actagcctgg 


ccaacgtggc 


aaaatcccgt 


ctctataaaa 


12600 


aatataaaag 


ccaggcgtgg 


tggtgcacat 


gcctgtaatc 


ccagttactc 


agaaggctga 


12660 


qgcatgagga 


ttgcttgaac 


ctgggaggca 

333 j «^ 


gaggatgcag 


taagccaaga 


tcaagccact 


12720 


gcactccagc 


ctgggcgaca 


gagtgagact 


ctgtccaaaa 


aaaaaaaaaa 


tccagacgtg 


12780 


gtcagagtcc 

3 3 "3 


atgggcagtg 


aatgaggaca 


gttgatggtg 


tgcaaaatcg 


acccacctct 


12840 


tgctacatcc 


ccaaggcctc 


atctcacccg 


agtccctcgc 


caaagcacag 


cggttttgcc 


12900 

J. L y V U 


gtgtgccctg 


ctgggatggc 


gctgcatggc 


acacacactg 


tgtaagtttg 


agtgcagctg 


12960 


aaacgaagcc 


gattccagac 


acccaggggc 


agggcggggt 


gtccgtgtgg 


ctgggaggcc 


13020 


tccttgtgtt 


agggggatgt 


tgccatcggc 


caggtgccct 


gctgtaagcc 


aacacatgga 


13080 
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gtcttgtatg 


acatgtgctc 


tgcatgagtg 


atgccgctgg 


gctgtacact 


gccatcttca 


1 A O 


catgtgtgaa 


tgagcacgtg 


actggggggt 


acttgggctg 


caagacagag 


u tea ngrg ug 


1 jZUU 


ggggatggaa 


cacgtgcacc 


agtgacccag 


gaacctctgc 


ctgttct teg 


gtaaaa tgea 




ccatttgcat 


cagcagttcc 


caaaattagt 


ctccaggtct 


atttacactc 


taaaacatLa 


i ^^9n 

1 jjZU 


tcgagggtct 


ccaagagctt 


ttgtttgttt 


ctgtgggttt 


tatgtctatc 


tgt tget taa 


1 J J o u 


catattagga 


attaaaatgg 


ggagattttc 


cttttttttt 


tttttttttg 


agatggagtc 


i a n 


tcgttctgtc 


gcccaggctg 


gagtgcagtg 


gctcgatctc 


ggctcactgc 


aagc l ucacc 


i jjuu 


tcctgggttc 


acgccattct 


cctgcctcag 


cctcccaggt 


agctgggac l 


acaggcaccc 


iJJUU 


gccaccacac 


ccggctaatt 


ttttttgtat 


ttttagtaga 


gac tgggu 


caeca uyita 




gccaggatgg 


tctcgatctc 


ctgacctcgt 


gatccaccca 


cctgggcctc 


ccaaagtgc l 


1 jDOu 


gggattacag 


gcatgagcca 


ctgcccggcc 


ttaaaatggg 


gagatttttc 


aageccaaga 


1 ^1AC\ 
1 J / iu 


tacacaagga 


agactgggca 


acatggcaag 


accctgactc 


tacaaaaaat 


tttaaaaita 




accaggcatg 


gtggcatgca 


cctgtgagcc 


cagcttcttg 


ggaggctgag 


gcaggagrau 


i fin 


cgcttgcacc 


caggaggtca 


aggctgcagt 


gagccgtgac 


tatgctactg 


cacucragca 




tgagtgacag 


agaccctggc 


tcaagaaaca 


caaacacaca 


cacacacaca 


cacacgcara 


i ^Qftn 

1 J jOU 


tagtccatta 


ggcatcaggg 


cgatgatggc 


atcagggagc 


ctgggaaact 


ctactggaca 


J. fi U fi U 


ttcatgggag 


aacaagtgaa 


aaaggcaaat 


aacatcttag 


tgttattcta 


aaatttcttc 


i a i nn 


ttttggcctt 


gtggacagga 


ccacgctttg 


agagctgtga 


ctgacatgcc 


tctguccLgt: 


i a i fin 


tgcgagggcc 


tatagtgcca 


agtgcatgag 


ctctggggag 


ggcttcgtgg 


gugcagagcr 




gggcctgtgg 


aggcccctca 


gacacaacac 


tggtggggct 


cagagctcca 


ggggcacucg 


i a 9ft n 

J.4ZOU 


agggaagaca 


agaaccggct 


ctgagatgcg 


tgaatgtgac 


agtgcatgag 


tagagatgga 




gaccttgtgg 


gtcccagaac 


caggactgca 


tatgactttc 


atatgtgggt 


atttt tgeer 


1 ft 3 u u 


tcatgggtcc 


cttcctgttt 


taaaaaaaat 


gtgtgattat 


gttgtcacaa 


agagtttatt 


i / a fin 


cctgtatatt 


gtgttaattt 


gtgttcagat 


ttgtaaagta 


aaattaaacc 


atttcageca 




ggtgtggtga 


cacatgcctg 


tagccctagc 


tacttacccc 


agaggctgag 


gtgggaggat 


14580 


cgcctgagcc 


cacgaggttg 


aagctgcagt 


gagccatgat 


cacacccctg 


cactccagac 


14640 


tgggcgacag 


agctgagatc 


ctatttcgtg 


ggccctaggt 


ccctgtgcct 


gctggaacag 


14700 


gacatcccta 


tcaccgtggt 


tggagccctt 


tggggtgcta 


agacctatga 


atgagggaaa 


14760 
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cttagggtgc ccaagctgag gtagagccct cagaaccccc tgggatttgt attggagccc 14820 
tcgtggcata acacaggtgg attatgcaat gggagtttct tacctataag cacccacatg 14880 
tgggcgggtg gagggtagga gccatgcact agggcttcag cccccagccc cttcccgctt 14940 
cagggcacac cttgcacttg gccagcctgg agctgggctt tcgggggtgg cacagcctgg 15000 
gctggctctg gccagcataa tctgtttctc ttttgtccct ccagggagga cctcaggggt 15060 
ggctgagctc ttacctggtt ttcaggcggg gcccatagaa ggtacggggg gtggatcctg 15120 
agttgggctt ctcgggagct cccatacatc acctactgct tctgactcta gttagtatcc 15180 
ccttccccac taaaccctgc tcactgtgga cccctcacta acctggcctg actgtggctc 15240 
tgaggcatct agtggtctgg cgctgggcct aggctaggct gggctgagga gagcctgggg 15300 
tgcaggccag ggctctgtga ctggcacctg cggtgctctt gagggtgtgg cgtctgggca 15360 
gctggctct.c tctttggtct gggggctgca gtctgtctcc ctctgtgcag gctgcctcgt. 15420 
tttctgcctt gtgttttttg cacctggggg agggccgtaa ctggggaatg gccgggatgg 154 80 
tagaatgggg agtgtgctgt gcccagcctc tggcacaaaa aatccagcca gggctgcagg 15540 
ttccttggtg agctttgcaa atcgtccccg acctcagtgc tggctccgca ccatgtaccc 15600 
ctgctgtgcc gttagccctg ttccctccca ggcctccggg ctcagggcct gttgtctttc 15660 
tgcagactgg ctcttcgtgg ttgtggtatg cctggctgcc ttcctcatct tcctcctcct 15720 
gggcatctgc tggtgccagt gctgcccgca cacttgctgc tgctacgtca ggtgcccctg 157 80 
ctgcccagac aagtgctgct gccccgaggc cegtaagtgt cccgctcatg gccaccctgg 15840 
tttgggcaac atcctgcatc caagggaagg aggtggccat ccacctgccc ccaggacagt 15900 
-ggcgttggtc tggagggtgt gaatttagcc agtggggaga aagtaggctg aggagggtct 15960 
gctgtttaga ttgtcgttta cttcctccaa cttttagttt atttttattt atgttgttct 16020 
tttcttttgt aagtataatc catacacatg gtaaaaatgt ccaacagtac aagatactag 16080 
tcacatggaa gtaaagccct ctaaaaaaac caaatcttgg ctaggcgcag tgattacgcc 16140 
tgtaatccca gcactttggg aggccaagac gagtggatca cttgaggtca ggagttccag 16200 
atcagcctgg ccaacatggt aaaacccagt tctctactaa aaatacaaaa attagctggg 16260 
catggtggtg atcgcctgta atcccagcta ctcaggagac tgaggcatga gaatcgctta 16320 
aacccaagaa gtggaggttg cagtgagctg agatcacgcc actgcactcc agcctgggcg 16380 
acagagtgag actctgtctc aaaaaaaaaa gaaaaaaaaa tgttaagtga aaaagttaag 16440 
aaaccaaaca aggtttacaa cactacatga tttaagcaaa aaaaattttt tttgttttag 16500 
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agaaagggtc tcattctgtc atccaggcag tgcagtgcga tcatagctct ctgcagcctc 16560 

aaactcccgg gttcaagcag tcctcccgcc tcagcctctg gagcagctgg gactgtaggc 16620 

acacaccacc atgcccagct aattttttga tttttgtttt ttgtagagac ggggtctcag 16680 

tatgttgccc agcctgatct caaactcctg gcctcaggtg atcctcccaa gtcagcctcc 16740 

ccaaagtgct gggattacag gcatgtgcca ccatgctggc caatttttaa aaattttctg 16800 

tagagacagg gtcttgctat gttgcccagg ctggtcttga actcttgacc tcaagtgatc 16860 

ctgcctcagg ctcccaaagt gatgggatta caggcatgaa ctaccacacc tggccttaaa 16920 

cttaagcaaa tttttttttt tttttggaga cagtttcact ctgtcgccca ggctggagta 16980 

aagtggcgtg atctctgctc actgcaacct ccgccccccg ggtttaagct attctcctgc 17040 

ctcagcctcc cgagtagctg ggatataggc gcctgccacc acgcctgact aatttttgta 17100 

tttttagtag agacggggtt ttgccatgtt ggccaggctg gtctcgaact cctgacctca 17160 

ggcagtccgc tcccccgcac ccctaccttg gcctcccaaa gtgttaggac tacaggtgtg 17220 

agccaccatg cctggccaaa tttaagcaaa tgtttgaaaa cacataccca caggaatgct 17280 

gcacatttta cccagctact atgtctaggg tcgtatctag cacaccagca tggctactgt 17340 

ggagagctgg gactggatgt gagatgagag ctaaagggga agtaagcaaa ccaagcaggg 17400 

gaaggtaaga gaagacagaa gacagagaga gagggaccta actctatgag aggagtcaga 174 60 



catgtgcaat 


tgaaaaagac 


ttgctcctgt 


ctctcttctg 


tgaatgtttg 


tgaatatccc 


17520 


aacgggacac 


tttcacagag 


gagctgattg 


acgtggtcac agccatcagc 


cttgggacac 


17580 


cagaccacag 


tgtgtacact 


aagtggcact 


gatggacact 


tcagcatccc 


tctagctgct 


17640 


gtcccgtttc ccctcctcgg ggaccacagc tgttgccagt ccttggtttc 


cttcaggagg 


17700 


gtgtctgggt 


agaccagcct 


gtgtgcacac 


agtccaagat 


acatgaacag 


tgaagtgcca 


17760 


ggcaatcctt 


gcaagcatgg 


gcaggtggag agctgaggcc 


tgcttgacac 


cttcctgctc 


17820 


agaagcccag 


tgagcagttt 


ccctccctag 


ggctcagtgt 


catcccctat 


aaaatggggc 


17880 


ttatggcaga 


gctcaccaca 


ctgggtgcat 


ctggggattt 


ggcgagctca 


tgtgcacacc 


17-94'0 


attgagcatg 


gggcccaacc 


tatataaaat 


attctacgtc 


tgtcagctgc 


tgggcactgc 


18000 


cactatcagc 


ctcagtagtg 


actgagggac 


agggcaccag 


tcagagccct 


ggtgcacaca 


18060 


gagtgacccc 


agagaagcag 


ccttccctct 


ctgagtcctg 


tttccttctg 


ttaggtcctg 


18120 


acttcatggg 


ttgttgttag 


cattaaggaa 


gtcgctggct 


aattttatag 


tcattgaagt 


18180 
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cagtggtgtg 


caacctggtt 


cctcaaagga tcacttccct 


gaaaaaattc 


cactgctccc 


18240 


tggaggctta 


tgcaggccat 


cccatcccct ccctcttgtt 


gtgttcagct 


gacagctttt 


18300 


tgctcagtga 


gtaagtgtta 


ggtccatttc acagatgggc 


tgcaaccaag 


tttgcagtga 


18360 


acccactaag accagagcta gggccaggac taaatgctgg 


tcccaatgcc 


acattcccct 


18420 


gtccccacac 


cacatttcct 


ccatccggag accctgttac 


cccaacccag 


ggccccatta 


18480 


actccctggc 


agaggccctg 


ttacatctgc tgctgccaca 


gcctccgccc 


acccttcagg 


18540 


aggcagcagg tcccactgct 


gatgataaag ttgcaggctg 


cctgagctaa 


tgaaggggct 


18600 


tcctctaggc 


tgtgcactta 


gtcttctgct tccaaaccaa 


atcagaggtg 


aggcaccctc 


18660 


tctgggccca 


tctctctcct 


ccattttcct gttggggtcc 


cagggaggaa 


gccacttgcc 


18720 


tagggcccag gaattttgca agcctcttgc cctagggagg aaggaaggga 


ggaggatctt 


18780 


accttgaact gtcaagccta gagcctggtg gggcaggcag 


aaatgggtgc 


agtccatgag 


18840 


ttagaaacac 


tagaggagac actttgctgc ttggccgggg caggcaagtt 


aattcccgag 


18900 


gctcctgcca ctgcatctca atctggaagg tgaccaggtg ggcaggaccc acgtctccca 


18960 


gatgactcat 


tttttctaga 


acaggggctt ggctgccaaa gaggatactt gatttcggct 


19020 


tgtggggaca. 


gtggtggacc 


cagcatctgg gctttatata 


aagggcagct 


ttgttgccct 


19080 


gtaaacacac 


agaccatggg 


tggccacttc ttccagtaag ttagctgggg 


agttggaagt 


19140 


ttaggtaaaa 


ccttttgatt 


gacaaatgtt ggcgaattac 


catgctgtta 


aatgaaacat 


19200 


tgttctgcca 


ccctggggct 


gtgggtgcct gcgtgcaccc tctgaaaaat 


cacacaggaa 


19260 


gtggggtggg 


gtctctgtga 


agctggtgtc ccccagcctc agggatgctg 


cagaaatgga 


19320 


atgaggacca 


acagggactc 


agatgtccaa ggaagctcta 


cagcggagag 


gacggcttgg 


19380 


gaaggaggtc 


caggcccagg 


tccctccgga acccaatggg 


tatggggcag 


cctggctcct 


19440 


gcctcatccc 


ccttctcctg 


ttgattgtgt cctcacagtg 


tatgccgccg 


gcaaagcagc 


19500 


cacctcaggt 


gttcccagca 


tttatgcccc cagcacctat 


gcccacctgt 


ctcccgccaa 


19560 


gaccccaccc 


ccaccagcta 


tgattcccat gggccctgcc tacaacgggt 


accctggagg 


19620 


ataccctgga 


gacgttgaca 


ggagtagctc aggtgaggcc 


gggggaagca 


ggaacagctg 


19680 


gtgggagtgt 


gctgggcatc 


tggacactga ggggcagggg 


ctggaaggaa 


gagtgtcttg 


19740 


ggagccgagg 


aggggctctg 


ctcctggtgc gcggccactg 


acagccactc 


tcccccagct 


19800 


ggtggccaag 


gctcctatgt 


acccctgctt cgggacacgg 


acagcagtgt 


ggcctctggt 


19860 


gagaatccat 


cgtcccgaag 


ttggatgtgc ctgtaaggga 


gaggggtggg 


ccaggatcca 


19920 
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tcctcccaaa 


ccgaccacca 


cccccctgtc cctagaagtc 


cgcagtggct 


acaqqattca 


19980 


ggccagccag 


caggacgact 


ccatgcgggt cctgtactac 


atggagaagg 


aqctqqccaa 


20040 


cttcgaccct tctcgacctg gcccccccag tggccgtgtg gagcggggta 


aqcaqqaqcc 


20100 


ttaaagtcta 


agggctttta 

3 3 3 


aggtgggggg gtgaaacatg tctccctgat 


acctqccqca 


20160 


qqqactcttg 


gtgcaaaccc 

3 3 


tggaccccgg gctcctccag 


cagtcagtga 


cacccccctt 


20220 


ccctcrcaQcc 


atgagtgaag 

3 3 3 3 


tcacctccct ccacgaggac 


gactggcgat 


ctcggccttc 


20280 


ccqqqqccct 


gccctcaccc 


cgatccggga tgaggagtgg 


ggtggccact 


ccccccqqaq 


20340 


tcccaqqqqa 


tgggaccagg 

3 3 3 3 3 


agcccgccag ggagcaggca 


ggcgggggct 


ggcgggccag 


20400 


acaaccccqq 


gcccgctccg 


tggacgccct ggacgacctc 


accccgccga 


gcaccgccga 


20460 


atcaaaaaac 


aggtctccca 


cgagtaatgg tgggagaagc 


cgggcctaca 


tgcccccgcg 


20520 


aaoccqcaqc 


cgggacgacc 

3 3 3 3 


tctatgacca agacgactcg agggacttcc cacgctcccg 


20580 


acacccccac 

M \^ \* 


tacgacgact 


tcaggtctcg ggagcgccct 


cctgccgacc 


ccaggtccca 


20640 


ccaccaccat 


acccqqqacc 


ctcgggacaa cggctccagg 


tccggggacc 


tcccctatga 


20700 


taaacaacta 


ctqqaggagg 


ctgtgaggaa gaaggggtcg 


gaggagagga 


ggagacccca 


20760 


caaaqaqqaq 


gaggaagagg 


cctactaccc gcccgcgccg 


cccccgtact 


cggagaccga 


20820 


ctccrcaqqcq 


tcccgagagc 


gcaggctcaa gaaggtgagg 


gccgccctcc 


ctggcgtcca 


20880 


aaccatccct 

\j X^ W X** x*^ W w 


aqqcccccaq 


ccggtccccg cggctcatac 


ccttctttct 


ttctcccttg 


20940 


raaaacttao 

a. o. ^ ^- *h 


ccctgagtcg 


ggaaagttta gtcgtctgat 


ctgacgtttt 


ctacgtagct 


21000 


tttatatttt 


tttttttaat 


ttgaaggaac actgatgaag 


ccctgccata 


cccctcccga 


21060 


crt* ctaataaa 


acgtataatc 


acaagctctg gagagaacca 


tttgttcggc 


cgcgcggggc 


21120 




ggctgctccc 


gtatgcgtct gtaaagcgcc 


gcgtcccggg 


ggcaccggag 


21180 


tccggggccg 


ggaggaagag 


acccagcctg gcccggcccg 


cgcccgcgcc gccggccgga 


21240 


gaacgtgccc 


cgcgcagccg 


ccgcccgcct gcgtgcgcgc 


cccggccccg 


cccaggcgtg 


21300 


cgcatgcgcc 


ccggccctcc gccttcgcgc accgcaggct ggccgtccgg gacgcgcgcg 


21360 


cgctcctctc 


cccttccagc 


ccatcccccc cagcccccca 


ccgacctact 


ttactgtctc 


21420 


caaactcggg 


cagcccacct 


ggcccccgac gaccccagcc 


cctgcaccgg 


gtaccccgac 


21480 


gttccatcca 


gacccgcgtt 


tcaccagggc ggcgcgcggc 


gacctcgcgc 


cccgcggagc 


21540 


cccgggctcg 


cgcgcgcccg 


cccgcccccg gagacagacc 


agcgcgcgcg 


ccccgggccg 


21600 



59 



I 



cctcccccca gcgcgcgtcc gccccggggc tcgcgccgcc gccgccgccg ccgccgcgcg 21660 

cgcgcagctc aagtaaagga ggaaaaaaaa aagggggaaa aatagaaagc ggcggcggct 21720 

gcagcagcga tccgccgccg gactgggcca agccgggcgg cggccgcgcg agccggcgat 21780 

ccagggcact ggcggcggcc agccagggcg ggccgtgttc aaaaaaaaaa gtcgcggcgg 21840 

cggcggctgc tcagggaagg aggcctgagg gccgcgtgca gcgggcgggc agctgggtgg 21900 

qctgggggcg gccgcgcggc gtcccggagc ctcgggccgc ccggagccgg cgggcgggcg 21960 

gaggcggagg cggcggcggc tgcagcggct gcaggagcgg cggcggctgc ggcggcggcg 22020 

gcggcatctc ctcctcacat gaccccactg tttgtccccg tgatcagcgc gagcggctcc 22080 

cgtatctcct ccgtcccctc ctgccgcgcg gcgtgagcgc cgggctcggg gcccccccgg 22140 

ccgcccgccc cctcccctcc ctccctcccc tcccctcccc tcccccccgg gccccgcgcc 22200 

ccccccgc'cc ccgccccccc catggacatg ctggacccgg gtctggatcc cgctgcctcg 222 60 

gccaccgctg ctgccgccgc caggtaagat ccccggcccg gccgtgcccc cgcgccccgg 22320 

ccccggcccc ggccccgcgg cctgcaggcc ggggccgcca tgatcccgag cggccgcggg .22380 

cccggctcaa aatggaggcc gccggcgcgg gggggacctg gcgcctcccg cccccggccc 22440 

ccggcctcgg cggcgccccc ggcctcaggc gcggccgggt gggactgggg ccctgcagct 22500 

gggcgcgggg gcgggggcgc gggcgcgggc cgcgctgacc ctgctccctc ctgtgcccct 22560 

ggcagccacg acaagggacc cgaggcggag gagggcgtcg agctgcagga aggtgagtgc 22620 

ttgccgggcc ggccgcgccc ggggagggct gggggcgctc ggcgcggccc tgaccgtgcc 22680 

ccgaccctcc tcggccccag gcggggacgg cccaggagcg gaggagcaga cagcggtggc 22740 

catcaccagc gtccagcagg cggcgttcgg cgaccacaac atccagtacc agttccgcac 22800 

agagacaaat ggaggacagg tgagcggcgg gccgcgaggg cgaacgggcg ggcgggcggg 228 60 

cgcgccggga aggctcggac ctggccccag cgccggcctc gccgctctgc cgccccctgc 22920 
aggtgacata ccgcgtagtc caggtgactg atggtcagct ggacggccag ggcgac . 2297 6 



<210> 20 
<211> 30 
<212> DNA 

<213> Rattus norvegicus 



<400> 20 

gttacagaat tcgccgcgat ggcgccggcg 



30 
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<210> 21 
<211> 20 
<212> DNA 

<213> Rattus norvegicus 
<400> 21 

gccaggacag tgtacgcact 



20 



<210> 22 
<211> 20 
<212> DNA 

<213> Ratus norvegicus 
<400> 22 

acctcaggtg tcccgagcat 



20 



<210> 23 
<211> 20 
<212> DNA 

<213> Rattus norvegicus 
<400> 23 

gaagatgact ggcgatcgag 



<210> 24 
<211> 20 
<212> DNA 

<213> Rattus norvegicus 
<400> 24 

acctctatga cccggacgat 



<210> 25 
<211> 20 
<212> DNA 

<213> Rattus norvegicus 



<400> 25 " 

caccaccctg acagtgcgta 



20 



20 



20 



<210> 26 
<211> 20 
<212> DNA 

<213> rattus norvegicus 
<400> 26 

ctgggggcat agatgctcgg 
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<210> 27 
<211> 20 
<212> DNA 

<213> Rattus norvegicus 
<400> 27 

gccctggaag gcctcgatcg 



<210> 28 
<211> 20 
<212> DNA 

<213> Rattus norvegicus 
<400> 28 

caagtcccta ggatcgtccg 



<210> 29 
<211> 20 
<212> PRT 

<213> Rattus norvegicus 
<220> 

<221> Misc_Feature 

<222> 169. .188 

<223> Position in SEQID2 

<400> 29 

Ser Ala Gin Asp Leu Asp Gly Asn Asn Glu Ala Tyr Ala Glu Leu I 
1.5 10 15 

Val Leu Gly Arg 

20 



<210> 30 
<211> 15 
<212> PRT 

<213> Rattus norvegicus 
<220> 

<221> Misc_Feature 

<222> 556. .570 

<223> Position in SEQID2 

<400> 30 

Glu Glu Gly Gin Tyr Pro Pro Ala Pro Pro Pro Tyr Ser Glu Thr 
15 10 15 



62 



<210> 31 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<400> 31 

atgcaacagg acggacttgg a 



<210> 32 

<211> 28 

<212> DNA 

<213> Homo sapiens 

<400> 32 

tcagacgact aaactttccc gactcagg 



<210> 33 
<211> 20 
<212> DNA 
<213> Homo 

<400> 33 
ctacaacccc 



sapiens 
tacgttgagt 



<210> 34 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 34 

tcgtgacctg acctttgacc agac 



<210> 35 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<400> 35 

cctgagctac tcctgtcaac gtct 



<210> 36 

<211> 20 

<212> DNA 

<213> Homo sapiens 



I 



<400> 36 

aggccgagat cgccagtcgt 



<210> 37 
<211> 27 
<212> DNA 

<213> Homo sapiens 
<400> 37 

ctacatggat ccagtcatgc cgaagat 



<210> 38 
<211> 28 
<212> DNA 

<213> Homo sapiens 
<400> 38 

cgacaactcg agtcagttgg tatcatgg 



<210> 39 . . 

<211> 15 
<212> PRT 

<213> Homo sapiens 
<220> . 

<221> Misc_Feature 
<222> 1. .14 

<223> Identical to 5 ... 18 in ref swissprot :Q07021 
<400> 39' 

Leu Arg Cys Val Pro Arg Val Leu Gly Ser Ser Val Ala Gly Tyr 
1 5 10 15 



<210> 40 
<211> 16 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> Misc_Feature 
<222> 2. .15 

<223> Identical to 268 282 in ref swissprot :Q07021 
<400> 40 

Cys Tyr He Thr Phe Leu Glu Asp Leu Lys Ser Phe Val Lys Ser 
15 10 15 
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<210> 41 

<211> 21721 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> exon 
<222> 1898. .2253 

<221> exon 
<222> 3438. .3782 

<221> exon 

<222> 12064. .12183 

<221> exon 

<222> 15049. .15105 

<221> exon 

<222> 15670. .15816 

<221> exon 

<222> 19486. .19659 

<221> exon 

<222> 19806. .19865 

<221> exon 

<222> 19963. .20094 

<221> exon 

<222> 20236. .20864 

<221> exon 

<222> 20954. .21094 

<221> Misc_Feature 

<222> 715 • 

<223> diverging nucleotide, G in ref genbank: AC002128 

<221> Misc_Feature 
<222> 1229 

<223> diverging insertion, G in ref genbank : AC002128 

<221> Misc_Feature 
<222> 3676 

<223> diverging nucleotide, T in ref genbank : AC002 128 

<221> Misc_Feature 
<222> 5039 

<223> diverging deletion, G in ref genbank: AD000684 

<221> Misc_Feature 
<222> 5118 
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I 



<223> diverging nucleotide, C in ref genbank: AD000684 

<221> Misc_Feature 
<222> 7337 

<223> diverging deletion, C in ref genbank: AD000684 

<221> Misc_Feature 
<222> 8294 

<223> diverging nucleotide, G in ref genebank: AD000684 

<221> Misc_Feature 
<222> 8604 

<223> diverging nucleotide, C in ref genbank: AD000684 

<221> Misc_Feature 
<222> 8928 

<223> diverging nucleotide, A in ref genbank: AD000684 

<221> Misc_Feature 
<222> 9021 

<223> diverging nucleotide, G in ref genbank: AD000684 

<221> Misc_Feature 
<222> 9851 

<223> diverging insertion, GAATGAAA in ref genbank:AD0 

<221> Misc_Feature 
<222> 9878 

<223> diverging nucleotide, C in ref genbank: AD000 68 4 

<221> Misc_Feature 
<222> 11478 

<223> diverging mucleotide, T in ref genbank: AD000684 

<221> Misc_Feature 
<222> 11577 

<223> diverging deletion, C in ref genbank: AD000684 

<221> Misc_Feature 
<222> 11779 

<223> diverging nucleotide, T in ref genbank: AD000684 

<221> Misc__Feature 
<222> 13411 

<223> diverging deletion, T in ref genbank: AD000684 

<221> Misc_Feature 
<222> 13538 

<223> diverging nucleotide, G in ref genbank: AD000684 

<221> Misc_Feature 
<222> 13896 

<223> diverging nucleotide, G in ref genbank: AD000684 

<221> Misc_Feature 
<222> 14912 

<223> diverging nucleotide, A in ref genbank: AD000684 
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<221> Misc_Feature 
<222> 16732 

<223> diverging nucleotide, C in ref genbank: AD000684 

<221> Misc_Feature 
<222> 17169 

<223> diverging nucleotide, G in ref genbank: AD000684 

<221> Misc_Feature 
<222> 18946 

<223> diverging deletion, G in ref genbank: AD000684 

<221> Misc_Feature 
<222> 19474 

<223> diverging mucleotide, G in ref genbank: AD000684 

<221> Misc_Feature 
<222> 20500 

<223> diverging deletion, G in ref genbank: AD000684 

<221> Misc_Feature 
<222> 20501 

<223> diverging deletion, G in ref genbank: AD000684 

<221> Misc_Feature 
<222> 20502 

<223> diverging deletion, A in ref genbank: AD000684 

<221> Misc_Feature 
<222> 21270 

<223> diverging nucleotid, G in ref genbank: AD000684 

<221> Misc_Feature 
<222> 21356 

<223> diverging insertion, T in ref genbank: AD000684 

<221> Misc_Feature 
<222> 21476 

<223> diverging nucleotide, A in ref genbank: AD000684 

<221> Misc_Feature 
<222> 21588 

<223> diverging insertion, C in ref genbank: AD000684 

<221> Misc_Feature 
<222> 21601 

<223> diverging deletion, T in ref genbank: AD000684 

<221> Misc_Feature 
<222> 21635 

<223> diverging insertion, G in ref genbank: AD000684 

<221> Misc_Feature 
<222> 19963. .19965 

<223> Potential variant splicing site AAG 
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i 



<221> Misc_Feature 
<222> 1. .21721 
<223> n= a,g,c or t 

<400> 41 



aacagtttgg 




aaa ? y L. Laaa 


flat* ansarta 


ccaaotcacc 


caacaattcc 


60 


attcttaggc 


dLaldLLLda 


el d y a ci ci lyaa 


arrpacia1~att 


tatacaccaa 


tattcacaqc 


120 


tgcactattt 


acaatagtca 


2i2a.arTrr"t~arraa 
daayy Lay a a 


araacctaoa 


tccatccaca 


aatgaatgga 


180 


taaataaaac 


gtagcatata 


catacaatgg 


tacactagtc 


cactotaaaa 


acraaatttta 


240 


atcttactgc 


atgctacatg 


gcttcgacat 


actacaacat 


aoataoacct 

V** M V-* W w 


tgaaaacatt 


300 


attctttgtg 


aaataaacta 


gacacaggac 


aaatgttaga 


caatt ccact 


tatatqaqqc 


360 


acctagaatg 


ggcaatttgg 


taagcaaagt 


agaatagaaa 


ttactaqaqq 


cacaggtagc 


420 


agggaatggg gagttactgt 


ttaatggtca 


cagagtttat 




atgaaacagt 


480 


ttcggggata aagagtggtg actggtacac gacattgtga 


atatacttaa 


tgccactgaa 


540 


ttttacactt 


qaaqtggtta 

3 3 3 3 


aagcgataaa 


tattatagnt 


ttacatattt 


tatcataaaa 


600 


atattttttt 


aaacgatgaa 


gggacgtgaa 

3 3 3 3 3 


cgggttgaaa 


ttttataaaa 


aqtqqccaqg 

3 33 33 


-660 


gaaggtgtca 


ctgcaatggt 


gtcctacagg 


aggaggaaga 


tcatatacac 


atctccqqqa 


720 


agggtgttct 


ggcagaggga 


gtagcacggg 


cgatggctct 


rra ncract Qt Q 

y O y y t* w »H *-* 3 


aaaaqtataq 


780 


ttggaaacag 


cgaggaggcc 


agggtgtccg 


aagctgagta 


acrccaoaaacr 


aatqqqaqqa 

"3 333 33 


840 


ggtgagataa 


gagggggaag 


gtcagtttct 


gctgagagtg 


^ y ^ >^ ^ w 


acaqqaqqqc 

""33^3 33 


900 


tgtgagcagg 


tggacgtgat 


ctggcttgag 


ttttaacagg 


rrppacrtacraa 

y \^ • u l t»i y u w 


caaaocacoc 


960 


ctgggtaccg 


aaaccagcca 


ctggccagtt 


ggcaacctgg 


rrnrra crt" c t a a 


cocoaQaaacj 


1020 


cgcccagggt 


tcccccagga 


tgcgctttcc 


ctcgccgcca 


fpf a era era ca 


acaaaot cac 


1080 


gcccagcgct 


gcgcaggctg 


atcgccgcgc 


cgcgcccccg 


rrrtcaatca 


caaot acrctc 


1140 


gttccgggaa 


ttcctaagcg 


gaaaccggtc 


ccaagccccg 




raaccccttt 

\«^ \*4 \*A \->* W V» W 


1200 


aagagccaga 


atttccggag 


ggctgacccg 


gggctaggga 


tgcccagggg 


ccgaaccaca 


1260 


agttgggaac 


gggtggggga 


ggtggcgaaa 


acttccgaag 


tggaattcca 


acttttcctg 


1320 


gccctgattc 


cccttgggca 


tccctgangg 


ggcagagctt 


cccttccggg 


gactttagag 


1380 


ggttcctcag 


gtcatctaac 


tgggagacac 


aggaggcccg 


aagcgccccc 


cctccacccg 


1440 


gtccggagga 


accccagtgg 


aagtggagaa 


gtcaggcgcc 


accaacaagc 


ctctcccagc 


1500 


caggactttg 


cttagactcg 


ctcctcccgg 


cagggcgcac 


etaggegggt 


ccatcgccag 


1560 
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ccggggagag 


gggtttgggc agggagggaa 


caggtgcgcg 


gcgggacccg 


ccctatctca 


1620 


acaggtgaat 


cgctccaagt gggtctcggt 


tgcatggatc 


tcggtgcgct 

i 


tggtttggcc 


1680 


ggagcagatg 


ggggccggaa 


gggacctgtg 


gtccgcaggc 


gccctcccag 


cgggccagtc 


1740 


acttggttcg 


ggccctgggg gacggagcgc 


acctgggtca 


gcccacttcc 


ggggagggag 


1800 


gcagaggaac 


ccctccccgc 


cgctcacccc 


taagcccagc 


cctcggctcc 


cacccttgtg 


1860 


tacctgggcc 


gaaccattca 


ccggagcgcg 


cagcgggtgg 


agtgtggctc 


ggaggaccgc 


1920 


ggcgggtcaa 


gcacctttct 


cccccatatc 


tgaaagcatg 


ccctttgtcc 


acgtcgttta 


1980 


cgctcattaa 


aacttccaga 


atgcaacagg 


acggacttgg 


agtagggaca 


aggaacggaa 


2040 


gtgggaaggg 


gaggagcgtg 


cacccctcct 


ggccttggtg 


cgcgccgcgc 


cccctaaggt 


2100 


actttggaag 


ggacgcgcgg 


gccagacgcg 


cccagacggc 


cgcgatggcg 


ctgttggccg 


2160 


gcgggctctc 


cagagggctg 


ggctcccacc 


cggccgccgc 


aggccgggac 


gcggtcgtct 


2220 


tcgtgtggct 


tctgcttagc 


acctggtgca 


caggtacggg 


gcacggggcc 


tctgacgctg 


2280 


cggaacgccg 


gagggaaetg tagaggggga tggatggagt 


tggaggcggc 


gggaagcggg 


2340 


aagcgggggt 


ctcagaggct 


gggaccttcc 


gatcccctgg 


gtcttgggcg 


atctgttgcg 


2400 


ncgcgggagt 


gagaggaatt 


ccccatttgt 


gccggggagc 


gctccccgcg 


cccttatctg 


2460 


gaagatagca 


ggaagtgaaa 


ctccctggac 


ggtgagaccc 


ggagcggcag 


ggagaatgga 


2520 


actctttgtg 


gggagggagt 


ggaagaccgc 


ccgatctctg 


ggaaaagaaa 


agccgggatg 


2580 


ggacttgggc 


gcacccgggg 


atttctaagt 


tttggagtaa 


cggggagagg 


gcacgggagg 


2640 


gctggatcag 


acgcttccta 


gagggacaga 


gacgaaggaa 


caatgcctag 


gcctcgggtg 


2700 


ggtgtgggac 


tggggactcc 


ccatcccccg 


caccccaccc 


acctcccgcg 


ggctccggat 


2760 


tatacgtgcg 


taagagtctg 


gtgggatgga 


tttacggact 


tgaaaccgac 


ttctgctggc 


2820 


aggctttcac 


ctggatggga 


tatttgggtg 


gtgatgaggt 


ctttcccgag 


acacttttgg 


2880 


ttcagtcatt 


tgaaatgact 


ttagagtagg 


gtgaggtggt 


gggaggctga 


tggagatatt 


2940 


gtgggggctt 


tagtccctcc 


atggcaaagc 


agttcaggca 


aacaactcca 


tggttttccc 


3000 


tccaaattca 


aaaggccccg 


ggtaacctgg 


aatccttcgt 


agtcggtttt 


gaagtggggc 


3060 


cttgggcgct 


gggggcatca 


acatggccat 


ctgggcttgc 


ctgcccaggc 


cacacagagg 


3120 


ccccttgttg 


tgggtgaatg 


gcaaagggaa 


gaggggactg 


gtgtggttca 


gaggccacag 


3180 


gctgggaaga 


gggatggcgg 


gcgagtccaa 


ggaaactggc 


cgtgtcaccg 


tgcacctgcc 


3240 


acttcagccc 


cacgggtcta 


taaaatgggc 


atgattatcg 


tggctacctc 


actggtcctg 


3300 
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* 



gcaattaagg aacaatgtgt gccaggcact ctgtaaacca catacttgcg agtgtcaagc 3360 

tggtgacagg tggcgttcct gttgaagcac ctccctgagc tcacagcaac ccttgctgtc 3420 

tctcctcttg ccctcagctc ctgccagggc catccaggtg accgtgtcca acccctacca 34 80 

cgtggtgatc ctcttccagc ctgtgaccct gccctgtacc taccagatga cctcgacccc 3540 

cacgcaaccc atcgtcatct ggaagtacaa gtctttctgc cgggaccgca tcgccgatgc 3600 

cttctccccg gccagcgtcg acaaccagct caatgcccag ctggcagccg ggaacccagg 3660 

ctacaacccc tacgtcgagt gccaggacag cgtgcgcacc gtcagggtcg tggccaccaa 3720 

gcagggcaac gctgtgaccc tgggagatta ctaccagggc cggaggatta ccatcaccgg 3780 

aagtatgttg ggcagggcag ggggatgagg ctgggcttgc ccgggtggtg ggactggcgt 3840 
ccttgtgcgg gacctggagt ccccatctga aagctcttga gtgccagtgt ctgaaaggac 
cattgaaggg agcaattctt tttttttttt tttttgaaga tggagtcttg ctctggactc 
caggctggag tgcagtggtg cgatctcagc tcactgcaac ctccacctcc caggttcaag . 
caattctctt gcctcagcct cccgagtagc tgggactcca ggtgcgtgcc accacgccca 

gttaattttt gtatttttag tagagatggg gtttcaccat gttggccagg ctggtctcaa 4140 



3900 
3960 
4020 
4080 



actcctgacc tcaaatgatc tgcccgcctt ggcctcgcaa agtgctgaga gacaccatac 
ccagcctaaa gggagcgatt ctattctact attcttcctt ctgctaatcc ttccattctt 
taatttaata acgaagattt tttgagtacc tgtcatatac caggtgctgt tctgggccct 4320 



4200 
4260 



4380 
4440 
4500 



gggaatacag ctgttaacaa aatcatcaaa ccacttccct cgtggagccc acattgcagt 
gagagagaca aacacgacac acactctcaa gtccttgaag ataaagaaaa ctgggtaacg 
gagagaagag gccagggttt gttctataat cattaataac acgagcagta agaagtaaaa 
tttatctaag taacaactta taaagggtct actgtgtgct aagctctcat ccaggttccc 4560 
aaggattaac tcagaccaca cagtaattga atagattcta tcattgtcat cttacagagg 4620 
cccagagaga gaaagtgact tgcctagtgt catagctggt aacggggctg ggattctaac 4 680 
tcagccactt tgggtctagt ggccaagctc ctaatccctt tgcttgccta gggtggtccg 4740 
cagaggactc acagaggaga tggcaggagt gaactgcagg ggcaagagag cttaatggag 4800 
aaagcctgtg acatgccagg aactgcacac atattctccc attgagtcct ctcctctacc 
ctcctgacag ctgaggcaca gagaggttac cttgttcaaa tgggtgcata ggaagtcaaa 
gtctggagct ggggtttgaa cccaggcagc cctgagaacc ttgttctttt tttttnannc 



4860 
4920 
4980 
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ggagtctcgc 


tctgtcgccc 


aggctggagt 


gcagtggcgg 


gatctcggct 


cactgcaagc 


5040 


tccgcctccc 


gggttcacgc 


cattctcctg 


cctcagcctc 


ccaagtagct 


gggactacag 


5100 


gcgcccgcca 


ctacgcctgg 


ctaatttttt 


gtatttttag 


tagagacggg 


gtttcaccgt 


5160 


tttagccggg 


atggtctcga 


tctcctgacc 


tcgtgatccg 


cccgcctcgg 


cctcccaaag 


5220 


tgctgggatt 


acaggcgtga 


gccaccgcgc 


ccggcccctt 


gttcttaact 


gtaatgctgc 


5280 


ctcctgatag 


gatgtgcctg 


ttgggactaa 


gtaaggggca 


gtcattcatt 


cattcatttg 


5340 


gtatttatca 


agcatcgact 


atgtgtcgtt 


ggtgctgggg 


atagaggtga 


ttgggatggc 


5400 


tgaagtttct 


gtcgtcaagg 


agatgacatt 


ctggtggagt 


nagactggca 


gtaaatnaag 


5460 


* 

cagataaaga aagagtatga gaatttcaaa gtctgggcac ggtggctcac gtctgtaatc 


5520 


tcagcacttt 


gggaggccaa 


ggtgggtgga 


tcacctgagg tcaggagttc 


cagaccagcc 


5580 


tggccaacat 
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ccactaaacc 


ctgctcactg 


tggacccctc 


actaacctgg 


cctgactgtg 


15240 
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gctctgaggc 


atctagtggt 


ctggcgctgg 


gcctaggcta 


ggctgggctg 


aggagagcct 


15300 


ggggtgcagg 


ccagggctct 


gtgactggca 


cctgcggtgc 


tcttgagggt 


gtggcgtctg 


15360 


ggcagctggc tctctctttg gtctgggggc tgcagtctgt 


ctccctctgt 


gcaggctgcc 


15420 


tcgttttctg ccttgtgttt 


tttgcacctg 


ggggagggcc 


gtaactgggg 


aatggccggg 


15480 


atggtagaat 


ggggagtgtg 


ctgtgcccag 


cctctggcac 


aaaaaatcca 


gccagggctg 


15540 


caggttcctt 


ggtgagcttt 


gcaaatcgtc 


cccgacctca 


gtgctggctc 


cgcaccatgt 


15600 


acccctgctg 


tgccgttagc 


cctgttccct 


cccaggcctc 


cgggctcagg 


gcctgttgtc 


15660 


tttctgcaga 


ctggctcttc 


gtggttgtgg 


tatgcctggc 


tgccttcctc 


atcttcctcc 


15720 


tcctgggcat 


ctgctggtgc 


cagtgctgcc 


cgcacacttg 


ctgctgctac 


gtcaggtgcc 


15780 


cctgctgccc 


agacaagtgc 


tgctgccccg 


aggcccgtaa 


gtgtcccgct 


catggccacc 


15840 


ctggtttggg 


caacatcctg 


catccaaggg 


aaggaggtgg 


ccatccacct 


gcccccagga 


15900 


cagtggcgtt 


ggtctggagg 


gtgtgaattt 


agccagtggg 


gagaaagtag 


gctgaggagg 


15960 


gtctgctgtt 


tagattgtcg 


tttacttcct 


ccaactttta 


gtttattttt 


atttatgttg 


16020 


ttcttttctt 


ttgtaagtat 


aatccataca 


catggtaaaa 


atgtccaaca 


gtacaagata 


16080 


ctagtcacat 


ggaagtaaag 


ccctctaaaa 


aaaccaaatc 


ttggctaggc 


gcagtgatta 


16140 


cgcctgtaat 


cccagcactt 


tgggaggcca 


agacgagtgg 


atcacttgag 


gtcaggagtt 


16200 


ccagatcagc 


ctggccaaca 


tggtaaaacc 


cagttctcta 


ctaaaaatac 


aaaaattagc 


16260 


tgggcatggt 


ggtgatcgcc 


tgtaatccca 


gctactcagg 


agactgaggc 


atgagaatcg 


16320 


cttaaaccca 


agaagtggag 


gttgcagtga 


gctgagatca 


cgccactgca 


ctccagcctg 


16380 


ggcgacagag 


tgagactctg 


tctcaaaaaa 


aaaagaaaaa 


aaaatgttaa 


gtgaaaaagt 


16440 


taagaaacca 


aacaaggttt 


acaacactac 


atgatttaag 


caaaaaaaat 


tttttttgtt 


16500 


ttagagaaag 


ggtctcattc 


tgtcatccag 


gcagtgcagt 


gcgatcatag 


ctctctgcag 


16560 


cctcaaactc 


ccgggttcaa 


gcagtcctcc 


cgcctcagcc 


tctggagcag 


ctgggactgt 


16620 


aggcacacac 


caccatgccc 


agctaatttt 


ttgatttttg 


ttttttgtag 


agacggggtc 


16680 


tcagtatgtt 


gcccagcctg 


atctcaaact 


cctggcctca 


ggtgatcctc 


cgaagtcagc 


16740 


ctccccaaag 


tgctgggatt 


acaggcatgt 


gccaccatgc 


tggccaattt 


ttaaaaattt 


16800 


tctgtagaga 


cagggtcttg 


ctatgttgcc 


caggctggtc 


ttgaactctt 


gacctcaagt 


16860 


gatcctgcct 


caggctccca 


aagtgatggg 


attacaggca 


tgaactacca 


cacctggcct 


16920 


taaacttaag 


caaatttttt 


tttttttttg 


gagacagttt 


cactctgtcg 


cccaggctgg 


16980 
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agtaaagtgg 


cgtgatctct 


gctcactgca 


acctccgccc 


cccgggttta 


agctattctc 


17040 


ctgcctcagc 


ctcccgagta 


gctgggatat 


aggcgcctgc 


caccacgcct 


gactaatttt 


17100 


tgtattttta gtagagacgg ggttttgcca tgttggccag gctggtctcg aactcctgac 


17160 


ctcaggcaat 


ccgctccccc 


gcacccctac 


cttggcctcc 


caaagtgtta 


ggactacagg 


17220 


tgtgagccac 


catgcctggc 


caaatttaag 


caaatgtttg 


aaaacacata 


cccacaggaa 


17280 


tgctgcacat 


tttacccagc 


tactatgtct 


agggtcgtat 


ctagcacacc 


agcatggcta 


17340 


ctgtggagag 


ctgggactgg 


atgtgagatg 


agagctaaag 


gggaagtaag 


caaaccaagc 


17400 


aggggaaggt 


aagagaagac 


agaagacaga 


gagagaggga 


cctaactcta 


tgagaggagt 


17460 


cagacatgtg 


caattgaaaa 


agacttgctc 


ctgtctctct 


tctgtgaatg 


tttgtgaata 


17520 


tcccaacggg 


acactttcac 


agaggagctg 


attgacgtgg 


tcacagccat 


cagccttggg 


17580 


acaccagacc 


acagtgtgta 


cactaagtgg 


cactgatgga 


cacttcagca 


tccctctagc 


17640 


tgctgtcccg 


tttcccctcc 


tcggggacca 


cagctgttgc 


cagtccttgg 


tttccttcag 


17700 


gagggtgtct 


gggtagacca 


gcctgtgtgc 


acacagtcca 


agatacatga 


acagtgaagt 


17760 


gccaggcaat 


ccttgcaagc 


atgggcaggt 


ggagagctga 


ggcctgcttg 


acaccttcct 


17820 


gctcagaagc 


ccagtgagca 


gtttccctcc 


ctagggctca 


gtgtcatccc 


ctataaaatg 


17880 


gggcttatgg 


cagagctcac 


cacactgggt 


gcatctgggg 


atttggcgag 


ctcatgtgca 


17940 


caccattgag 


catggggccc 


aacctatata 


aaatattcta 


cgtctgtcag 


ctgctgggca 


18000 


ctgccactat 


cagcctcagt 


agtgactgag 


ggacagggca 


ccagtcagag 


ccctggtgca 


18060 


cacagagtga 


ccccagagaa 


gcagccttcc 


ctctctgagt 


cctgtttcct 


tctgttaggt 


18120 


cctgacttca 


tgggttgttg 


ttagcattaa 


ggaagtcgct 


ggctaatttt 


atagtcattg 


18180 


aagtcagtgg 

- 


tgtgcaacct 


ggttcctcaa 


aggatcactt 


ccctgaaaaa 


attccactgc 


18240 


tccctggagg 


ct iia ugcagg 
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aactaacaoc 


18300 


tttttgctca 


gtgagtaagt 


gttaggtcca 


tttcacagat 


gggctgcaac 


caagtttgca 


18360 


gtgaacccac 


taagaccaga 


gctagggcca 


ggactaaatg 


ctggtcccaa 


tgccacattc 


18420 


ccctgtcccc 


acaccacatt 


tcctccatcc 


ggagaccctg 


ttaccccaac 


ccagggcccc 


1 o 4 oU 


attaactccc 


tggcagaggc 


cctgttacat 


ctgctgctgc 


cacagcctcc 


gcccaccctt 


18540 


caggaggcag 


caggtcccac 


tgctgatgat 


aaagttgcag 


gctgcctgag 


ctaatgaagg 


18600 


ggcttcctct 


aggctgtgca 


cttagtcttc 


tgcttccaaa 


ccaaatcaga 


ggtgaggcac 


18660 
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cctctctggg 


cccatctctc 


tcctccattt 


tcctgttggg 


gtcccaggga 


ggaacjccact 


18720 


tgcctagggc 


ccaggaattt 


tgcaagcctc 


ttgccctagg 


gaggaaggaa 


gggaggagga 


18780 


tcttaccttg 


aactgtcaag 


cctagagcct 


ggtggggcag 


gcagaaatgg 


gtgcagtcca 


18840 


tgagttagaa 


acactagagg 


agacactttg 


ctgcttggnc 


cggggcaggc 


aagnttaatt 


18900 


cccgaggctc 


ctgccactgc 


atctcaatct 


ggaaggtgac 


caggtggggc 


aggacccacg 


18960 


tctcccagat 


gactcatttt 


ttctagaaca gggg'cttggc tgccaaagag gatacttgat 


19020 


ttcggcttgt 


ggggacagtg 


gtggacccag 


catctgggct 


ttatataaag ggcagctttg 


19080 


ttgccctgta 


aacacacaga 


ccatgggtgg 


ccacttcttc 


cagtaagtta 


gcuggggagt 


19140 


tggaagttta 


ggtaaaacct 


tttgattgac 


aaatgttggc 


gaattaccat 


gctgttaaat 


19200 


gaaacattgt 


tctgccaccc 


tggggctgtg 


ggtgcctgcg 


tgcaccctct 


gaaaaa tcac 


19260 


acaggaagtg 


gggtggggtc 


tctgtgaagc 


tggtgtcccc 


cagcctcagg 


gatgctgcag 


19320 


aaatggaatg 


aggaccaaca 


gggactcaga 


tgtccaagga 


agctctacag 


cggagaggac 


19380 


ggcttgggaa 


ggaggtccag 


gcccaggtcc 


ctccggaacc 


caatgggtat 


ggggcagcct 


19440 


ggctcctgcc 


tcatccccct 


tctcctgttg attatgtcct 


cacagtgtat 


gccgccggca 


19500 


aagcagccac 


ctcaggtgtt 


cccagcattt 


atgcccccag 


cacctatgcc 


cacctgtctc 


19560 


ccgccaagac 


cccaccccca 


ccagctatga 


ttcccatggg 


ccctgcctac 


aacgggtacc 


19620 


ctggaggata 


ccctggagac 


gttgacagga 


gtagctcagg 


tgaggccggg 


ggaagcagga 


19680 


acagctggtg 


ggagtgtgct 


gggcatctgg 


acactgaggg 


gcaggggctg 


gaaggaagag 


19740 


tgtcttggga 


gccgaggagg 


ggctctgctc 


ctggtgcgcg 


gccactgaca 


gccactctcc 


19800 


cccagctggt 


.ggccaaggct 


cctatgtacc 


cctgcttcgg 


gacacggaca 


gcagtgtggc 


19860 


ctctggtgag 


aatccatcgt 


cccgaagttg 


gatgtgcctg 


taagggagag 


gggtgggcca 


19920 


ggatccatcc 


tcccaaaccg 


accaccaccc 


ccctgtccct 


agaagtccgc 


agtggctaca 


19980 


ggattcaggc 


cagccagcag 


gacgactcca 


tgcgggtcct 


gtactacatg 


gagaaggagc 


20040 


tggccaactt 


cgacccttct 


cgacctggcc 


cccccagtgg 


ccgtgtggag 


cggggtaagc 


20100 


aggagccttg 


gggtctgagg 


gcttttaagg 


tgggggggtg 


aaacatgtct 


ccctgatacc 


20160 


tgccgcaggg 


actcttggtg 


caaaccctgg 


accccgggct 


cctccagcag 


tcagtgacac 


20220 


cccccttccc 


tgcagccatg 


agtgaagtca 


cctccctcca 


cgaggacgac 


tggcgatctc 


20280 


ggccttcccg 


gggccctgcc 


ctcaccccga 


tccgggatga 


ggagtggggt 


ggccactccc 


20340 


cccggagtcc 


caggggatgg 


gaccaggagc 


ccgccaggga 


gcaggcaggc 


gggggctggc 


20400 
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gggccaggcg 


gccccgggcc 


cgctccgtgg 


acgccctgga 


cgacctcacc 


ccgccgagca 


20460 


ccgccgagtc 


agggagcagg 


tctcccacga 


gtaatggtgg 


gaggagaagc 


cgggcctaca 


20520 


tgcccccgcg 


gagccgcagc 


cgggacgacc 


tctatgacca 


agacgactcg 


agggacttcc 


20580 


cacgctcccg 


ggacccccac 


tacgacgact 


tcaggtctcg 


ggagcgccct 


cctgccgacc 


20640 


ccaggtccca 


ccaccaccgt 


acccgggacc 


ctcgggacaa 


cggctccagg 


tccggggacc 


20700 


tcccctatga 


tgggcggcta 


ctggaggagg 


ctgtgaggaa 


gaaggggtcg 


gaggagagga 


20760 


qgagacccca 


caaggaggag 


gaggaagagg 


cctactaccc 


gcccgcgccg 


cccccgtact 


20820 


cggagaccga 


ctcgcaggcg 


tcccgagagc 


gcaggctcaa 


gaaggtgagg 


gccgccctcc 


20880 


ctggcgtcca 


gaccgtccct 


gggcccccag 


ccggtccccg 


cggctcatac 


ccttctttct 


20940 


ttctcccttg 


cagaacttgg 


ccctgagtcg 


ggaaagttta 


gtcgtctgat 


ctgacgtttt 


21000 


ctacgtagct 


tttgtatttt 


tttttttaat 


ttgaaggaac 


actgatgaag 


ccctgccata 


21060 


cccctcccga 


gtctaataaa 


acgtataatc 


acaagctctg 


gagagaacca 


tttgttcggc 


21120 


cgcgcggggc 


gggggaccgg 


ggctgctccc 


gtatgcgtct 


gtaaagcgcc 


gcgtcccggg 


21180 


ggcaccggag 


tccggggccg 


ggaggaagag 


acccagcctg 


gcccggcccg 


cgcccgcgcc 


21240 


gccggccgga 


gaacgtgccc 


cgcgcagcca 


ccgcccgcct 


gcgtgcgcgc 


cccggccccg 


21300 


cccaggcgtg 


cgcatgcgcc 


ccggccctcc 


gccttcgcgc 


accgcaggct 


ggccgccggg 


21360 


agcgcgcgcg 


cgctcctctc 


cccttccagc 


ccatcccccc 


cagcccccca 


ccgacctact 


21420 


ttactgtctc 


caaactcggg 


cagcccacct 


ggcccccgac 


gaccccagcc 


cctgctccgg 


21480 


gtaccccgac 


gttccatcca 


gacccgcgtt 


tcaccagggc 


ggcgcgcggc 


gacctcgcgc 


21540 


cccgcggagc 


cccgggctcg 
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cgcgcgcccg 


cccgcccccg 
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21600 


tcccgggccg 


cctcccccca 


gcgcgcgtcc 


gccccgggct 


cgcgccgccg 


ccgccgccgc 


21660 


cgccgcgcgc 


gcgcagctca 


agtaaaggag 


gaaaaaaaaa 


agggggaaaa 


atagaaagcg 


21720 


g 
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